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A Vision of the Growth Process™ 


By ARNOLD C. HARBERGER * 


One of the great pleasures of belonging to 
j generation of economists is that we were 


Presidential Address delivered at the one-hundred 
meeting of the American Economic Association, 
ary 4, 1998, Chicago, IL. 
Department of Economics, University of California, 
Angeles, CA 90095. This paper is but one step in a 
uence of writings and other presentations on the pro- 
af growth. In this work I have been greatly helped by 
jurnber of research assistants — Luis Alvarado and 
fald Beyer in the early stages, Alfonso Guerra and 
pr Robles in the middle phase, and Enrique Flores and 
nardo Torre in the final stage, as this paper was being 
ared. Readers will note that this paper also draws on 
D. dissertation work done by Beyer, Robles, and Torre. 
the course of the evolution just referred to, I have made 
‘sentations at, and received valuable comments from, a 
mber of different forums—-seminars at Clemson Uni- 
sity, Stanford University, Texas A&M University, and 
FLA in the United States, plus several abroad—the 
tholic University of Chile, the University of Chile and 
Centro de Estudios Público (CEP) in Santiago, the 
nter for Argentine Macroeconomic Studies (CEMA), 
> Instituto Torcuato Di Tella, the Universidad de San 
drés, and the Instituto Superior de Economistas del Go- 
imo (ISEG) in Argentina, as well as the Instituto Tec- 
logico Autónomo de Mexico (ITAM) in Mexico. In 
dition, there were conference presentations organized 
Cornell University, the East-West Center (University 
Hawaii), the Association of Asian Economists (Kuala 
mpur, 1996), the Westem Economic Association (Se- 
le, 1997), the Econometric Society (Latin American 
letings, Santiago, 1997), and the Argentine Political 
onomy Association (Bahia Blanca, 1997). 
I want to make special note of a conference on capital 
mation and economic growth, organized by Michael J. 
skin and held at the Hoover Institution in October 1997. 


: conference was attended by a veritable galaxy of se- 





lus students of growth, a fact which motivated me to 
ite a very long paper (Harberger, 1998) probing issues 
methodology in the measurement and analysis of 
bwth, as well as the more substantive matters empha- 
ed in the present paper. I take this opportunity to refer 
eke to that paper (forthcoming in a volume edited by 
skin) for more detailed treatment of methodological is- 
ps. I also want to give special thanks to Zvi.Griliches, 
ile W. Jorgenson, and Paul Romer, as well as to my 
leagues Sebastian Edwards, Jean-Laurent Rosenthal, 
u Carlos Végh, each of whom gave that paper a very 
eful reading and provided me with extremely useful 

ments. Finally, I want to give special thanks to 
rraine Grams, who has performed miracles translating 
dwritten scrawls faxed from four continents into the 
ex you see before you. 





able to witness the birth and the subsequent . 
evolution of the modern approach to the 
analysis of economic growth. The center- 
piece of that approach is probably growth 
accounting, but we should never forget that 
growth accounting is firmly rooted in eco- 
nomic theory. 

My way of telling the story goes like this: 
Many, maybe even most, economists expected 
that increments of output would be explained 
by increments of inputs, but when we took our 
best shot we found that traditional inputs typ- ` 
ically fell far short of explaining the observed 
cutput growth. Our best shot consisted in at- 
tributing to each factor a marginal product 
measured by its economic reward. Thus: 


(1) pAy=WAL+(P+SAKH+R. 


Here: 


Ay = change in output (GDP); 
AL = change in labor input; 

p = initial general price level; 

w = initial real wage; 

p = initial real rate of return to capital; 

6 = rate of real depreciation of capital; 
AK = change in capital stock; and 

R = ‘‘the residual’’ of growth unexplained 

by increases in traditional inputs. 


Many economists are probably more famil- 
iar with a variant of (1) 


(1°)  (Ay/y) = (WL /py)(AL/L) 
+ [ (~ + 6) K/py ] (AK/K) 
+ (R/y) = se(AL/L) 
+ 5,(AK/K) + (R/y). 

In whichever form, the measured residual 
typically accounted for an important fraction 


of the observed output growth, quite often half 
or more. 


bo 


This result came as a surprise to the profes- 
sion, though perhaps less so to those who 
reached it, or something very like it, by an al- 
ternative route. They were the people who 
came at the problem out of a tradition of mea- 
suring labor productivity, and at some point 
complemented output per worker with a mea- 
sure of output per unit of capital, and finally 
joined the two to create a measure of total fac- 
tor productivity (TFP). The idea of total factor 
productivity increasing through time was less 
a shock to these people than the “‘growth re- 
sidual’’ was to those who approached its mea- 
surement along the lines of equation (1) or 
(1’). See Moses Abramovitz (1952, 1956) 
and Solomon Fabricant (1954). 

In any case, as the newly discovered residual 
loomed large in our professional thinking, our 
discussion centered on two potential explana- 
tions: ‘human capital’ and ‘‘technical ad- 
vance.” (See Robert M. Solow, 1957.) These 
can be thought of as complementary explana- 
tions, at least up to a point, with technical ad- 
vance representing truly new ways of doing 
things, and the accumulation of ‘“‘human capi- 
tal” representing increases in the “‘quality”’ of 
the typical human agent. It was not long before 
attempts were made to quantify the contribution 
of improved labor quality. These came as part 
of a general move toward disaggregation of the 
two factors, which can be represented by: 


(2) pay = 2w,AL; 
+ (p; + 5) AK; + R’. 


Here the index i can vary over all sorts of ed- 
ucation and skill groups as well as categories 
like gender, age, occupation, region, etc. All 
these are items that may signal a different mar- 
ket wage. In a similar vein, the index j would 
appropriately vary over categories like the cor- 
porate, noncorporate, and housing sectors 
where, for tax if for no other reasons, different 
~ (gross-of-tax) rates of return would presum- 
ably prevail, even in a full equilibrium. 

In an equation like (2), the presumed mar- 
ginal product of each category of labor is mea- 
sured by the wage w;. Average quality can be 
measured by Q, = 2, Ww; Lir/ 2i Lir, and the con- 
tribution of change in quality to Ay, between 
tand t+ 1 can be calculated as Lw, (A Q, + 1/ 
Q,). Thus, the contribution of quality change 
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is already built into the first summation in (2 elopmen 
but can be separately identified if we s nical : 
choose. I 
A focus on'human capital could lead us {e qualit 
a slightly different way of breaking dovputions O: 
Ew;AL;. Here we could choose some ‘basi xternalit 
wage” w*, ideally the wage of some we ale, spl 
defined category of relatively unskilled labo ities. A 
Then we could divide the remuneration w; ind, ma 
any given category into a part w* which w CEO, 0 
a reward for ‘‘raw labor’ and another paj I think 
(w; — w*) which we would identify as thcterize } 
reward to the human capital of a typicyaise of 
worker of type i. abel for 
Using a framework like (2) has long bogerlving 
the standard for careful professionals. Pitay we | 
neered by Zvi Griliches (1960, 1963), it w . They : 
utilized by Edward F. Denison (1967). ant- Thin 
John W. Kendrick (1973, 1976, 1977), amo as cert: 
others. This approach has been further Eda to. 
oped and carried to a high art by Dale Wiexities 
Jorgenson and Griliches (1967), Jorgenson gomic £ 
al. (1987), and Jorgenson (1995). ' Let m 
The main point to be made here is that ondled. In 
the residual is measured using a framewo R l 
like (2) or its equivalent, the direct, measurdess exe 
contribution of human capital is captured gome P< 
the labor term EW; AL; . By direct contributi one 1n 
I mean what people are paid for. Doctors e ath to 
more than nurses, and engineers more th ense ag 
draftsmen. These and similar differences afims th 
captured in Zw,AL;, which can be posi with 


even if EAL; is zero, just from an upward t at cor 
ounts. 












shuffling of the same labor force. A truly a 
curate measurement of type (2) would capt 
all the subtle differences of quality that exi ent tt 
in a modern labor force and would give ea 
a weight corresponding to the ( gross-of-t 
earnings that demanders are observed to paj Teal c 
We may do this imperfectly, but, in concept ackgrc 
least, the residual R’ as measured by (2) dog esses 
not contain any elements of quality change ¢ oCh 
any direct contributions of human capital parts o 
growth. This is a quite important point for $ °S ne 
permits us to zero in on the residual as repré odels 
senting ‘‘technical change,” ““TFP improv Sreem 
ment,” and ‘‘real cost reduction.” ortin 
There is no analytical reason to prefer of ut me 
of the above three terms over another, in T as cu 
ferring to the residual R’. But I am going a ick ( 
out on a limb to say that a term like ‘‘technic: uce 
change’’ leads most economists to think of isp Mes 
ventions, of the products of research and dey Our 
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first summation in (yelopment (R&D), and of what we might call 
y identified if we gchnical innovations. On the other hand, TFP 
i mprovement, once purged of the changes in 
capital could lead usthe quality of labor and/or the direct contri- 
ray-of-breaking doyutions-of human capital, makes one think of 
ld choose some ‘‘bagxternalities of different kinds —economies of 
e wage of some weczle, spillovers, and systematic complemen- 
atively unskilled labearities. And finally, real cost reduction, to my 
the remuneration w; mind, makes one think like an entrepreneur or 
o a part w* which wa CEO, or a production manager. 
bor” and another px I think it would be perfectly fair to char- 

would identify as thcterize my presentation today as a paean in 
1 capital of a typicsreise of ‘‘real cost reduction’’ as a standard 
a abel for R’. Labels do not change the un- 
like (2) has long.bederlying reality, but they may change the 
ul professionals. Pivay we look at it and the way we think about 
s (1960, 1963), it wł. They also can lead us to understand it bet- 
. Denison (1967) agr. Thinking in terms of real cost reduction 
'3, 1976, 1977), amonas certainly done all this for me, as I have 
has been further devetied to sort out the many puzzles and com- 

high art by Dale Wlexities that surround the process of eco- 
‘s (1967), Jorgenson omic growth. ` 
son (1995). Let me try to take you down the path I trav- 
. made here is that onded. In the first place, real cost reduction 
ed using a framewoRCR) is probably on the mind of most busi- 
at, the direct, measuré€ss executives, production managers, etc., at 

capital is captured ome point or another in any given week, let 
. By direct contributidone in any given month or year. It is a major 
: paid for. Doctors earath to profit in good times, and a major de- 
i engineers more thênse against adversity in bad times. Most U.S. 
similar differences aifms that have downsized in recent years did 
which can be positi with RCR in mind. So, too, did the firms 
tei from an upward raat computerized their payrolls and other ac- 
abor force. A truly aunts. And so also did those who shifted to 
type (2) would captuthat they considered more modern manage- 
es of quality that exifemt techniques. I recall going through a 
le and would give eadothing plant in Central America, where the 
z to the (gross-of-ta*wner informed me of a 20-percent reduction 
zs are observed to paf real costs, following upon his installation of 
ectly, but, in concept @Ckground music that played as the seam- 
: measured by (2) dodfesses worked. And then there is the story of 
its of quality change YO Chilean refrigerator firms that ended up as 
is of human capital tts of a single conglomerate at one point. 

important point for $2 new management reduced the number of 
1 the residual as repri¢déls from something like 24 to two, making 
nge,” “TFP improvements to import other models while ex- 
reduction.” Orting these two. The end result was that out- 
al reason to prefer ortt more than doubled, while the labor force 
ngyover another, in ras cut to less than half, and even the capital 
> But lam going a block (at replacement cost) was significantly 
ta term like ‘‘technic®Cuced. This sounds like (and is really ) econ- 
momists to think of i™ies of scale, but they would not be detected 
ts of research and dð Our usual measures, as both labor force and 
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capital stock went down. And we all have seen 
cases where, say, an office’s real costs were 
reduced when a martinet of a manager was re- 
placed by someone more reasonable. But we 
have also-seen- cases where real-costs. were re- 
duced when a very lax manager was replaced 
by someone more strict. 

It has long been my song that there are at 
least 1001 ways to reduce real costs and that 
most of them are actually followed in one part 
or other of any modern complex economy, 
over any plausible period (say, a decade). 
Once one accepts this proposition as true, the 
question then arises: Why would anybody try 
to settle on just one underlying cause of real 
cost reduction? The answer, I think, is mind- 
set—the framework in which one is thinking 
at the moment. The pioneer writings of the re- 
cent endogenous growth literature can, I think, 
be said to reflect a kind of annoyance at some- 
thing like R or R’ being considered exogenous. 
There was an urge to surmount that inelegance 
by somehow making the residual endogenous. 
And in a simple growth model that meant gen- 
erating a feedback from the rest of the model 
to the residual. A 1001 feedbacks would be out 
of the question, but one feedback would work 
just fine. Thus Paul Romer (1986) focused on 
a feedback through ‘‘knowledge,’’ with the 
stock of knowledge shifting production func- 
tions all over the economy; Robert E. Lucas, 
Jr. (1988) focused on ‘‘human capital,” not 
on its direct and remunerated productivity, but 
on the externalities that each increase in the 
stock of human capital were presumed to gen- 
erate. These single feedbacks achieved the 
limited purpose of endogenizing R or R’ 
within a specified model, but they did not rep- 
resent very well the multifaceted nature of real 
cost reduction as we observe it in actuality. 
And, in point of fact, both the cited authors in 
their more recent writings display a deep rec- 
ognition of the subtlety and complexity of the 
growth process, not really capable of being 
captured through a simple féedback mecha- 
nism. (See Romer, 1990, 1994a, b; Lucas, 
1993.) 

So, real cost reduction is multifaceted and 
everywhere around us. Where does that get us? 
Or how can we get anywhere in the face of 
such complexity? The next step is to recognize 
that in spite of its complexity, real cost reduc- 
tion can be reduced to a single metric, and can 


4 THE AMERICAN ECONOMIC REVIEW 


TABLE 1—GROWTH BREAKDOWN TREATING REAL Cost REDUCTION AS ADDITIVE 


TFP growth Absolute amount 
over period of real cost 
(1.0 = 100 reduction 

Industry percent) 1) x (4] 

(1) (2) 

1 0.800 $80b. 

2 ; 0.600 $120b. 

3 0.500 $150b. 
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peee 
Cum. sum Initial value Cum. ; 
of (2) added of (4) 
(3) (4) ©) 
$80b. ` $100b. $100 
$200b. $200b. s3oquumber é 
$350b. $300b. go0ggailroad: 


All the rest 0.107 $150b. $500b. $1,400b. $2,00gfextile M 
o 


be made additive. For a quick appreciation of 


this, assume that total factor productivity grew- 


by 80 percent in one industry over a decade, 
by 60 percent in another industry, and by 50 
percent in a third. If their initial value added 
amounted to $100 billion, $200 billion, and 
$300 billion, respectively, then the real cost 
reduction of the first was $80 billion, that of 
the second was $120 billion, and that of the 
third $150 billion. So we can say that, mea- 
sured at initial prices, the real cost reduction 
of the three together was $350 billion over the 
decade in question. I truly think that the notion 
of real cost reduction being additive in this 
way came to my mind, and is easily seen by 
others, just as a consequence of the label. The 
idea of additivity does not follow nearly so 
easily from the labels ‘‘technical advance’ 
and ‘‘total factor productivity.” 

Anyway, this vision of the growth process 
opens up many new vistas and gives us many 
new challenges. To me, it gives life to the re- 
sidual, viewed as real cost reduction, in a way 
that remote macroeconomic externalities 
never did. It gives the residual body, in the 
sense that the number of dollars saved by real 
cost reduction is a tangible and measurable 
quantity. It gives the residual a name (real cost 
reduction), an address (the firm), and a face 
(the face~of-the entrepreneur, the--CEO, the 
production manager, etc.) And, finally, we 
shall see that there can be vastly different ex- 
pressions on that face, even as we move from 
firm to firm in a given industry, as the TFP 
experience of a period moves from sharply 
positive to devastatingly negative. 


jranspor 


I. Yeast versus Mushrooms: PartI  pemicai 


Table 1 is based on the numerical sie j 
just given, plus the information that the retoleui 
maining industries (say, in the economy) ti 
gether had an initial value added of $1,4¢tré 
billion and experienced real cost reduction (fining 
$150 billion over the period. Setting out d 
in the format of Table 1 allows us to mafommu 
statements like ‘‘15 percent ($300 b./$2, = 
b.) of the industries (measured by initial val 
added) accounted for 40 percent ($200 here fo 
$500 b.) of the real cost reduction (RCR) he com’ 
the period’ and ‘30 percent ($600 b./$2,00 — 
b.) of the industries accounted for 70 percep org 
($350 b./$500 b.) of the period’s RCR.”  fop 22 

I stumbled on this way of presenting data ge f 
real cost reduction in the course of writingtP 40 
background paper (Harberger, 1990) for owh 1 
World Bank’s World Development Report} i 
1991. Once I saw it, I immediately embraci 
it, because it helped me communicate to oth i 
what I call the ‘‘yeast versus mushrooms” jeduct 
sue. The analogy with yeast and mushroopauses 
comes from the fact that yeast causes bread {enar 
expand very evenly, like a balloon being fillévork r 
with air, while mushrooms have the habit ¢ Pers 
popping up, almost overnight, in a fashion the ‘‘: 
is not easy to predict. I believe that a ‘‘yeastemer 
process fits best with very broad and .generOme ¢ 
externalities, like externalities linked to tent,” 
growth of the total stock of knowledge or gon. J 
human capital, or brought about by economit220’ 
of scale tied to the scale of the economy asies, 1 
whole. A ‘‘mushroom’’ process fits mote 19. 
readily with a vision such as ours, of real contibi 
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TABLE 2—CONCENTRATION OF TFP GrowTH AMONG U.S. INDUSTRIES 1958—1967 


[CoLumns (2) To (5) In BILLIONS oF 1958 DorLars] 
g Absolute amount 
ritial value Cum. xe of real cost Cum. GDP by Cum. 
added of (45 .. ... .. TFP growth over period... reduction sum of industry sum of 
W OE (1.0 = 100 percent) 11) x (4)] (2) 1958 (4) 
Siah aa (1) (2) (3) (4) (5) 
$200b. saoqumber & Wood Products 0.72 2.51 2.51 3.50 3.50 
| $300b. $60qailroad Transport : 0.63 5.52 8.03 8.70 12.20 
| $1,400b. $2,00jertile Mill Products 0.61 2.49 10.52 4.10 16.30 
a jectrical Machinery 0.55 5.10 15.66 9.30 25.60 
j t 
ransport Equipment 0.46 7.05 22.71 15.20 40.80 
Mushrooms: Party bemicals 0.44 3.97 26.68 9.10 49.90 
th ical dblic Utilities 0.42 4.65 31.33 11.00 60.90 
. the numerical exampy 
nformation that the rroleum and Coal 0.41 1.27 32.60 3.10 64.00 
ay, in the economy) & : 
value added of $1 Agee and Products 0.41 1.23 33.83 3.00 67.00 
d Ee pi reductiontning 0.41 5.20 39.03 12.60 79.60 
period. Setting out dë 
e 1 allows us to maommunication 0.40 3.61 42.64 9.00 88.60 
excent ($300 b/$2,08 0.33 24.93 6757 7640 165. 
ieasured by initial vali eon —— 
` 40 percent ($200 here follow 18 more industries 
ost reduction (RCR) combined results of which are 0.03 7.53 75.10 239.80 404.80 
yercent ($600 b./$2,0% 
scou A 70 5 to ptes: Top 10 percent (these percentages are contributions to GDP of industries ranked according to their present rate of 
RESER os p H Ëp growth over period) of industries account for 30 percent of total TFP contribution. 
Ae period’s RCR. bp 22 percent (these percentages are contributions to GDP of industries ranked according to their percent rate of TFP 


Development Report & 
immediately embracă 

2 communicate to othe 

versus mushrooms’ ëductions stemming from 1001 different 

1 yeast and mushroofiuses, though I recognize that one can build 
at yeast causes breadéenarios in which even 1001 causes could 

xe a balloon being filllork rather evenly over the whole economy. 

ooms have the habit §Personally, I have always gravitated toward 





crnight, in < fashicu te “‘mushrooms’’ side of this dichotomy. | 


{ believe that a ‘‘yeastember being impressed, when I first saw 
very broad and geneme early industry estimates of TFP improve- 
ərnalities linked to #tnt, by their tendency to industry concentra- 
ck of knowledge or ðn. For years I told my students that the 
gft about by economi?20’s were the decade of cars and rubber 
le of the economy ases, the 1930’s the decade of refrigerators, 
n’’ process fits mot 1940’s that of pharmaceuticals (especially 
ich as ours, of real c@ltibiotics), and the 1950’s that of television, 





‘ay of presenting data towth over period) of industries account for 52 percent of total TFP contribution. 

the course of writing? 40 percent (these percentages are contributions to GDP of industries ranked according to their percent rate of TFP 
fawth over period) of industries account for 70 percent of total TFP contribution. 

arberger, 1990) for Eres. Kendrick and Grossman (1980). GDP data from U.S. national accounts. 


with telecommunications and computers tak- 
ing over in recent decades. But these were just 
impressions, not based on any systematic ap- 
proach. My real turnaround came in the course 
of writing my 1990 paper, where I presented 
2 series of tables based ou Kendrick and Eilivi 
S. Grossman’s (1980) work. Table 2 is an 
example. . 

Table 2 has the same format as Table 1. Col- 
umn (1) presents the familiar measure of the 
percentage by which TFP grew, or real costs 
were reduced, during the period in question 
(note that the percentages apply to the period 
1958—1967 as a whole; they are not annual 
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rates). To turn these percentages into dollar 
amounts of real cost saving over the period, 
one multiplies them by base-period real GDP 
[col. (4)]. The results are shown in column 
(2). Columns (3) and (5) are the cumulative 
sums of columns (2) and (4), respectively. 
Working with these figures one can make 
statements like those at-the bottom of the 
table—i.e., the top 10 percent of industries 
accounted for 30 percent of total real cost re- 
duction; the top 22 percent of industries ( mea- 
sured by initial value added) accounted for 
more than half of total real cost reduction. 

Readers will notice that at the foot of each 
column in the table is an entry referring to 18 
additional industries, which together accounted 
for only 10 percent of the total TFP contribution, 
while their combined share of initial output was 
almost 60 percent of the total. ; 

Using the analogy with yeast and mushrooms, 
the results of my calculations using the 
Kendrick-Grossman data pointed very clearly to 
a “‘mushrooms’’ interpretation. Not only were 
the contributions to RCR highly concentrated in 
a relatively few industries, these industries also 
were very different as one shifted from decade 
span to decade span. The top four branches in 
percentage of real cost reduction during 1948-— 
1958 were Communications, Public Utilities, 
Farming, and Miscellaneous Manufacturing. In 
1958—1967 they were Lumber, Railroad Trans- 
por, Textile Mills, and Electrical Machinery. In 
1967—1976 they were Finance, Insurance & 
Real Estate, Apparel, Communications, and 
Chemicals. Only Communications appears twice 
among these 12 listings. 

Now to my mind, this already brings 
evidence to bear on a number of possible- hy- 
poiheses concerning the nature of TFP 
improvement. Certainly some ways of inter- 
preting a generalized externality due to im- 
proved education would be hard to justify 
using evidence like this. Strong links of the 
residual term to R&D expenditures! would 
suggest a high degree of persistence among the 
leaders in TFP improvement. So also (proba- 
bly) would economies of scale associated with 


the scale either of the firm or of the industry. -- 


! For a review of the current status of analysis of R&D 
expenditures and their impact on economic growth see 
Griliches (1994). í 
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Such economies are not likely to jump mi; eal. 





around from one industry to the next, from pi Scio 
riod to period. One would expect them to ey 170 
body characteristics of the productive proc 
that would be relatively stable over tim 
hence they should show a reasonably high di w 
gree of persistence, over time, in terms of th” 
TFP experience of particular industries. | 

No economist can look at Table 2 witho; 
thinking of its close analogy with a Lore 
curve. That, indeed, was the next step I tog , 
in trying to represent the degree of concent a 
tion of real cost reduction. Figure 1 (draw 
from Edgar Robles, 1997), shows the qua: 
Lorenz curves for a 20-industry breakdown į 
the U.S. manufacturing sector over four Sut so 
cessive five-year periods. 

What strikes one immediately about Figuj an 
1 is the characteristic “‘overshooting.’’ I hay 
marked with the first vertical line the poi ® 
where the rising curve crosses 100 percent d'atn" 
the vertical axis. The interpretation is that }, ost 
1970—1975 the cumulative real cost reducti 
of just 25 percent of manufacturing industri 
(measured by initial value added) was equ | 
to the total RCR for manufacturing asa whok =, | 4< 
After that there are other industries produci 0 
another 40 percent of the total, but their cot 
tribution is offset by still other industries wit 
negative RCR during the period. 

Corresponding to the 25-percent figure fé 
1975, we have around 12 percent for 1975. 
1980, 48 percent for 1980—1985, and 40 pë 
cent for 1985-1991. These are the fractions to 170 per 


. manufacturing industry which by themselving. In 1° 


were able to account for the full amount of repercent, ir 
cost reduction during the respective period, M 1985- 
manufacturing industry as a whole. (The troub 

The second vertical line in each panel tribution i 
Figure 1 marks the maximum point of th0f the po: 
curve. The interpretation is that about 64 pef that ag 
cent of industries enjoyed real cost reductiolatge, this 
during 1970—1975, with the remaining 36 pe!970-19° 
cent suffering real cost increases (declinis!2 manuf: 
TFP). For the subsequent periods, the corrPefcent o` 
sponding figures are 65(35) percent, 78 (2}f0utrast, t 
percent, and 82(18) percent. Here the first fialmost 1C 


-ure is-the percent of industries enjoying redded in 


cost reductions; the figures in parentheses ref!985-19 
resent those experiencing declining TFP. The pr 

Some interest attaches to the ordinate of t)Pounded 
maximum point on each curve. In the first pitate (int 
riod, TFP growth ended up accounting for clos! be neg 
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y as a whole. The trouble is that when the aggregate TFP con- 
il line in each panel fibution is relatively small, the cumulative total 
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FIGURE 1. PROFILES OF TFP GROWTH AMONG U.S. MANUFACTURING BRANCHES 


to be established to make clear the interpretation 
of Lorenz-like diagrams in such cases. 

I believe I have hit on a felicitous way of 
solving all these problems, and at the same 
time creating an even better, clearer visual rep- 
resentation of the degree of concentration or 
dispersion. of real cost reduction among the 
components of an aggregate. The idea is sim- 
ply to relabel the vertical axis of the Lorenz- 
like diagram, making it represent an annual 
growth rate. For simplicity, think ‘of a 30- 
degree line as representing 1 percent per an- 
num of TFP growth. The rest of the vertical 
axis would be calibrated accordingly. Thus, by 
looking at the slope of a simple chord; we 
could visually assess how. rapid was the TFP 
growth of the aggregate in question. 

Figure 2 is presented simply for didactic 
purposes. Here we have a hypothetical indus- 
trial branch made up of four industries, A, B, 
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FIGURE 2. ILLUSTRATIVE TFP GROWTH PROFILES (SUNRISE-SUNSET DIAGRAMS) 


C, and D. First, we order the industries in de- 
sceñding order, according totheir rates of TFP 
increase in the period. Then we calculate cu- 
mulative real cost reduction (a real dollar 
amount) and plot it against cumulative initial 
real value added. Then we scale the vertical 
axis so as to comply with whatever metric we 
have decided upon for the TFP growth rate (in 


the example, a 30-degree line representing! w 
l-percent annual TFP growth.rate), .and-th 
horizontal axis so as to add up to 100 percen 
In the lower panel of Figure 2 I give ex 
amples to show how these diagrams cope wil 
the problems of a low TFP growth rate (th 
overshoot for the case of 0.25-percent growl 
would show up peaking at over 400 percent i 
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a Lorenz-iype diagram) and of negative TFP 
growth (where it is hard to even conceptualize 
a Lorenz-type picture). 

I first presented these diagrams before a 
large audience at the Western Economic As- 
sociation meetings in Seattle (July 1997), and 
for that presentation coined the label of ‘‘sun- 
rise diagrams’’ on their analogy with the sun 
rising over a hill. That same evening Yoram 
Barzel sugsested that where the argrepate 
slope is negative, we apply the term ‘‘sunset 
diagrams,” which I immediately accepted. 

Figure 3 presents a set of sunrise-sunset 
diagrams based on Jorgenson et al. (1987 pp. 
188-90). These cover 32 industrial sectors 
(their 35 minus Agriculture, Trade, and 
Government Enterprises); I think the utility 
of sunris2-sunset diagrams needs no further 
championing once these pictures are exam- 
ined and digested. Practically all variants are 
represented in these real-world cases: low 
TFP growth with a huge overshoot (1953- 
1957 and 1969-1973); negative growth 
with large and moderate overshoots (1966- 
1969 and 1973-1979); moderate growth 
with small (1979-1985), medium (1960- 
1966), end large (1948—1953) overshoots. 

One striking fact that emerges from this set 
of pictures is how variable across periods is 
the negative contribution of the losers. If the 
losers had only contributed zero change in 
TFP, we would have had cumulative TFP con- 
tributions of about 0.8 percent per annum in 
1948-1953, in 1957-1960, and in 1960- 
1966. And the other periods would not have 
been much different: about 0.7 percent in 
1953-1957 and in 1969-1973, 0.6 percent in 
1966-1969, and 0.5 percent in 1973-1979 
and 1979-1985. Instead of this narrow range 
of cumulative contributions, we have an actual 
distribution that goes from —0.9 percent in 
1973-1979 through around 0.1 percent in 
1953-1957 and 1969~1973 to over 0.5 
percent in 1960—1966 and 1979—1985. 

Does this not suggest that we make a major 
research push trying to improve our under- 
standing of the phenomenon of negative TFP 
growth? What syndromes characterize the firms 
and industries experiencing it? How much of it 
stems from external shocks like international 
prices? How much of it from competition 
within the industry? How much of it represents 
firms struggling to survive, yet experiencing 
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output levels well below their previous peaks} ‘ 
(and presumably below installed capacity)? f ‘ 
How much of it represents things like ‘‘labor Cum. 


I hope that in the previous section I have 
made a convincing case concerning: (a) the} 
usefulness of sunrise-sunset diagranis, (b) ihe; 
aptness of the ‘‘yeast versus mushrooms’’ dì- 
chotomy, and (c) the pervasiveness with 
which the mushroom side of that dichotomy 
seems to come out ahead when the GDP is 
broken down into industries or industrial} 
branches for TFP analysis. The grand design, 
that emerges from the studies reported here,| 
and from just about all the other industry! 
breakdowns that I recall having seen, is that: E 
(i) a small-to-modest fraction of industries 
can account for 100 percent of aggregate real 
cost reduction in a period; (ii) the comple- 
mentary fraction of industries contains win- 
ners and losers, the TFP contributions of: Fic 
which cancel each other; (iii) the losers are 
a very important part of the picture most of 
the time, and contribute greatly to the varia’ luck i 
tions we observe in aggregate TFP pa with F 


hoarding’’ as firms go through periods on Re 
adversity? Cc 
Redu 

I. Yeast versus Mushrooms: Part II | 

Í 


mance; and (iv) there’ is little evidence of} tries t 
persistence from period to period of the lead vance 
ers in TFP performance. as lon. 





The above results are, I think, very in Get 
teresting (in the sense of piquing our curių mits c 
osity), very strong (in terms of their implij patibk 
cations about the nature of the growth} indust 
process), and very robust (in the sense thall exploi 
they have wide applicability over different alread 
data sets analyzed by different authors using; ails j 
at least somewhat different methods). Bu} dustry 
these results, so far, are quite compatible sector 
with what I might call an ‘‘industry view’} only c 
of the TFP story. This is the way I, myself} States 
looked at the growth process until quite cently 
recently—a vision that was reflected in my} Comes 
stories about rubber tires and autos in thé for wk 
1920’s, refrigerators and other household data fr 
appliances in the 1930’s, pharmaceuticals is} fractio 
the 1940’s, etc. The image that I had is} ing da 
mind was one of yeast within each industry} sampl: 
and mushrooms between industries—4 firms, 
commonality of TFP experience by firms withif, so tha: 
an industry, depending on that industry’) branct 
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picture most 0 
itly to the varialuck in the technological draw, side by side 
ate TFP perfotwith highly diverse experience between indus- 
ttle evidence dtries because the distribution of technical ad- 
#iod of the leadvances had wide dispersion, even for periods 
as long as a decade. 

think, very in Getting access to data at the firm level per- 
.quing our curimits one to explore whether this view is com- 
s of their implipatible with the actual experiences of firms and 
of the growtlindustries. We are just in the early stages of this’ 
n the sense théexploration, but I think the result is quite clear 
y over differenalready; namely, the ‘‘mushrooms”’ story pre- 
nt authors usinwails just as much among firms within an in- 
methods). Budustry as it does among industries within a 
uite compatiblsector or broader aggregate. I will present here 
industry view'only a taste of the evidence from the United 

: way I, myseiiStates (on whicń our systematic work just re-— 
ess until quittently got started). Our massive evidence 
reflected in mtomes from the Mexican manufacturing sector, 
nd autos in thfor which Leonardo Torre (1997) has analyzed 
ther householiiata from a sample of over 2,000 firms. A small 
imaceuticals ifraction of these firms were lost owing to miss- 

: that I had iting data, but some 1,900 firms remained in the 
n each industr’ample that Torre finally worked with. These 
industries — firms were divided into 44 branches of industry, 

e by firms withito that on average we have about 43 firms per 
that industry’branch. 
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FIGURE 4. TFP GROWTH PROFILE IN MEXICAN MANUFACTURING SECTOR (1892 ESTABLISHMENTS, 1984-1994) 


There are really too many ways to present 
such a mass of information as is contained in 
Torre’s study. What I will do here is give the 


‘aggregate picture in Figure 4, and then show 


in Figures 5A—C three fast-growing branches, 
three of around median growth, and three from 
among the slowest-growing branches. 

To complement these figures, I finally pres- 
ent, in Figures 6A—D, certain summary statis- 
tics from the sunrise-sunset diagrams of the 44 
branches that Torre studied. Here Figure 6A 
gives the distribution of average rates of TFP 
growth among the 44 industries. Figure 6B 
shows the distribution of peak cumulative con- 
tributions, i.e., what the TFP contribution 


would have been had all the negatives been ` 


zeros: Figure 6C displays the percentile of 
firms (by initial value added ) marking the bor- 
derline between positive and negative TFP 
growth. And finally, Figure 6D shows, for 
branches with positive TFP growth, the per- 
centile of firms which, by themselves, account 
for 100 percent of the industry’s TFP growth. 

This evidence almost seems to replicate, for 
firms within an industry, what was found in 
the previous section for industries within the 
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FIGURE 5A. TFP GROWTH PROFILES FOR FAST-GROWING BRANCHES (MEXICAN MANUFACTURING, 1984-1994) Fic 


economy —rampant overshooting of sunrise- 
sunset diagrams, great influence of firms with 
negative TFP growth in determining the TFP 
‘outcome for an industry, and a small or mod- 
erate fraction of firms accounting for 100 per- 
cent of the TFP growth of an industry (when 


that growth is positive), with the complemen- ` 


tary fraction being winners and losers whose 
efforts end up just offsetting each other. It re- 


we h 
mains to try to give some interpretation toe th 
those results. is by 

ran i 

IH. ‘‘Just Errors” or “It’s a Jungle. of ra 

Out There?” actly 

šuns 

The first question that will enter the minOnly 
of many economists on looking at the eviiot ç 
dence presented so far is: how much of whadr o; 
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3, 1984-1994) Figure 5B. TFP GROWTH PROFILES FoR MEDIUM-GROWING BRANCHES (MEXICAN MANUFACTURING, 1984—1994) 


ve have seen and emphasized might simply 

nterpretation t0e the result of errors of observations? This 
* s by no means a frivolous question. For one 

‘an actually create frequency distributions 

’s a Jungle Xf rates of TFP increase which contain ex- 
ictly the same information as the sunrise- 

unset diagrams previously presented. The 

enter the minOnly trick is to count as the unit of frequency 
cing at the evilot one firm (out of an industry aggregate) 
v much of whar one industry (out of some larger aggre- 


gate) but, instead, say, 1 percent of the total 
value added of the aggregate. Thus a firm 
with 20 percent of the value added of an in- 
dustry would appear with 10 times the 
weight of a firm accounting for 2 percent of 
the value added of that industry. In such a 
chart, the cumulative frequency (say, 68 per- 
cent) above ATFP = 0 would represent the 
projection on the horizontal axis of the 
maximum point on a sunrise diagram. Its 
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Figure 5C. TFP GROWTH PROFILES FOR SLOW -GROWING BRANCHES (MEXICAN MANUFACTURING, 1984—1994) 


complement (32 percent) would represent 
the initial value added associated with neg- 
ative TFP performance during the period. 
If, then, all the information could be gen- 
erated by a properly designed frequency dis- 
tribution of rates of TFP growth, could it not 
‘all be the result of chance alone—more spe- 
cifically, of errors of measurements? I really 
think not—my favorite quip on this is that 
‘‘white noise does not sing a tune.”’ That is, if 
we can rationalize what we see in terms of an 


analytical framework which embodies we 
established economic principles and sensit 
presumptions about underlying relationshi 
and facts, this is itself strong evidence agai: 
the white noise hypothesis. 
Nonetheless, we have to face the fact tt 
errors of observation of some magnitude ct 
tainly do exist, and we must recognize tl | 
they can cloud our perceptions and bias c 
results. What I am going to do here is consic 
frequency distributions of firms. TFP is me 
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FIGURE 6A. AVERAGE ANNUAL TFP GROWTH RATE: MEXICAN MANUFACTURING 1984—1994 
FREQUENCY DISTRIBUTION, 44 INDUSTRIAL BRANCHES 


ured in two ways—one using value added by 
t set of firms on the one hand, and the other 
ising ‘‘output’’ by those same firms, mea- 
ured through dividing value added by sepa- 
ate estimated firm-by-firm price (of value 
\dded) indexes p;. For these purposes we can 
‘onveniently think in terms of logarithms, so 
et: 


j = observed value added of firm; 

f = estimated firm-level price index; 

F= Yj — p; = estimated output; 

į =T; + e; [T; = true value added]; 

j= m; + uj [r; = true price index]; and 

j= T; —-1; [true output of firm). 

c, 1984-1994) Ye would like to have data on g; and its 
ariance 


i = ob — 20t + o}. 
embodies wel 

les and sensib we simply work with observed value added 

ing relationshijs our quantity variable, we get 

exidence againt 

| c3 = of + o2 (assuming o7; = 0). 

ace the fact th 

e magnitude cef we worked with the measured y,, we get 

st recognize thi 

ons and bias ol? = ot+o3+0324+ 02-20% 

> here is considé 

ms. TFP is mef (assuming u and é to be strictly random). 


My presumptions are as follows: 


(i) We can estimate value added quite ac- 
curately at the firm level. Hence the pre- 
sumption that o?2 is small. 

(ii) In most industries, there is considerable 
variety among the firms and their prod- 
ucts. Hence, except in cases of industries 
with very homogeneous products, we 
should not expect o2 to be small. Hence, 
I expect 02 > o?. 

(iii) Finally, we have the presumption that, at 
least at the level of firms within an in- 
dustry, o;7 < 0. We know that firms 
choose to operate in regions of the de- 
mand curve where they consider the elas- 
ticity facing them to be greater than one. 
But also, in an analysis of the growth 
process, one would expect the big gains 


in value added-to accrue to those firms. - 


in an industry which were passing along 
to consumers some of the fruits of cur- 
rent or past real cost reductions. 


These three presumptions lead me to the con- 


clusion that o3 is likely to understate the true 
variance of output oj 4 (because —2c; > 0 and 
ci > c4), and that g3 is likely to overstate 
a$ (only the covariance terms with é and ù, 
which were assumed to be zero, could make it 
otherwise). And since Torre worked with real 
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FıGURE 6B. MAXIMUM AVERAGE ANNUAL TFP GROWTH RATE (MAX. ORDINATE OF TFP PROFILE): 
MEXICAN MANUFACTURING, 1984—1994 
FREQUENCY DISTRIBUTION, 44 INDUSTRIAL BRANCHES 


value added as his quantity variable, this sug- 
gests that, if anything, the substitution of the 
“true quantity variable’’ q for observed value 
added v would have given results with 
greater dispersion of TFP, and consequently 
greater overshooting in his sunrise-sunset 
diagrams. 

The above demonstration should be taken as 
merely suggestive. It is not important to me 
that Torre’s results underestimate the variabil- 
ity of the different firms’ TFP experience. It is 
` only important that measurement error should 
not be the principle determinant of those re- 
sults. On this I feel very confident. In my view, 
it really is ‘‘a jungle out there,’ with winners 
and losers in every period—good as well as 
bad. 

As I have noted earlier,.we are only just be-_ 
ginning a systematic study of TFP among U.S. 
firms, so I can offer no display comparable to 
Torre’s. 

However, Robles (1997) did examine the 
experience of 12 firms in the U.S. oil industry. 
His results are summarized in Figure 7. But 


Robles tells basically the same story as Toompere 


* Three firms out of the 12 were more than sufthat 


ficient to generate the real cost reduction eteco 
perienced by the total group. Half (or smokes 
half) of the firms had negative TFP growth iof ti 
each period. And the cumulated amount of thibusi 
negative TFP growth was sizeable when metwhe 
sured against the total TFP performance of ththey 
industry. chal 
What I see in TFP performance is quite andithey 
ogous to what I see in the stock market pages cess 
the newspaper. There are winners: and Josef F 
every day, every month, and every year. j 
gains and losses come from all sorts of causetfere 
World price shocks can drive firms into nega ito ti 
TFP performance if the consequent output réup 
ductions are greater than the reductions of itApp 
_puts. So, too, can cyclical or secular declines ithe, 
demand, including those caused by the succestsury 
ful actions of competitors. a 
When firms are under stress, they typicallfcen 
fight to stay alive. Maybe they fight for tofend 
long in some cases, in the sense that less &per 
society’s resources would be wasted if of 
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FIGURE 6C. PERCENTAGE WITH Positive TFP GROWTH: MEXICAN MANUFACTURING 1984-1994 


PROFILE): 
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~~ 
} story as Torīyere to quit earlier in response to a challenge 
¿p more than subat turns out to be deadly. But they do not 
Re reduction “lecognize the challenge as deadly, so they 
Half (or almoyeep struggling to survive. I believe this is part 
> TFP growth if the nature of entrepreneurs, CEOs, and 
d amount of thiusiness leaders in general. They would not be 
‘able when me¥here they are, doing what they are doing, if 
formance of they were ready to quit at the first sign of a 
eos hallenge. They are fighters by nature, and 
ace is quite anthey probably would not have achieved suc- 
: market pages fess if they were not. 
anes and lose Firms with negative TFP growil. may even 
every year. The innovators. New challenges come and dif- 
1 sorts of causeèrent firms think of different ways to respond 
rms into negat'» them. Some (like Intel and Microsoft) end 
guent output Np winners; others (Montgomery Ward and 
reductions of ilpple?) end up losing. But it may not be that 
ecular declines bey just waited passively and tried to fight to 
:d by the succesùrvive in the face of negative shocks. They 
_ Bay have had quite innovative ideas, with de- 
ss, they typicallent prior probabilities of success, but in the 
rey fight for tend success did not come. Thus, negative TFP 


ense that less erformance can, and I believe often does, 
2 wasted if thome cimnty fram “haclkine the urnana hares ”? 


FREQUENCY DISTRIBUTION OF PERCENTILES, 44 INDUSTRIAL BRANCHES 


To me, Joseph A. Schumpeter’s vision 
(1934) of ‘‘creative destruction’’ captures 
much of the story. What he is saying is, yes, 
it’s a jungle out there, but the processes of that 
jungle are at the core of the dynamics of a 
market-oriented economy. They are what got 
us to where we are, and they hold the best 
promise for further progress in the future.” 

In my opinion, Schumpeter saw through to 
the essence of the problem, but it is not wise 
for us to be fatalistic in accepting his vision. 
We cannot lose by making a major effort to 
understand the process of TFP improvement 
where it happens—at the level of the firm. 
This is all the more true because of the 


? The idea of creative destruction has come up in recent 
literature, in a context of formal modeling as distinct from 
this paper’s focus on growth accounting and the intuitive 
economic interpretation of its results (see Gene M. ` 
Grossman and Elhanan Helpman, 1991, 1994; Philippe 
Aghion and Peter Howitt, 1992). For an econometric study 


emphasizing the variability of performance among firms, 
caa Tananac RMaivracca and (Meilinhac (1002 ne INN DAN 
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pervasiveness of negative as well as positive 
TFP performance among the components of 
almost any aggregate. By learning more about 
this aspect of the aggregate picture, we may 
stumble upon ways to ‘‘accentuate the positive 
and eliminate the negative’’ parts of the TFP 
story. But that is too quixotic a goal to take as 
the point of focus right now. To me, the pres- 
ent task is simply to get hold of the huge mass 
of information that is available at the firm 
level and squeeze it hard enough to wring out 
as much understanding and as much insight as 
we can. 


IV. Some Observations on Methods 
and Research 


What I am about to say in this section is not 
meant to consist of direct implications of what 
has gone before. Instead, I think of the earlier 
parts of this paper as building a case for a cer- 
tain vision of the economy, and of how the 
forces of growth work within it. This vision in 
turn leads one to think in different ways not 


only about the growth process itself but aba 
how we, as economists, might best advan ¢ 

P ; { Cum. Su 
our study and understanding of it, and ho Real Co 
policies might be molded so as better to prt Reductic 
mote it. 


(a) It is always wise to study the soi 
nents of growth separately. The rate of i 
vestment, the rate of return on capital, the ra 
of growth of the labor force in numbers or} 
hours worked, the contribution of human ca 

ital or of the increment in average quality: 

labor, and the residual representing real co 
reduction—all these are sufficiently differen 
and potentially sufficiently disjoint from ea 

other, to merit their being treated separately. 
would give special emphasis to the followin 
three points.. -~ ~- eae 


(i) The worthwhileness of measuring t 
rate of return and emphasizing its role 
the growth process. 


(ii) The importance of focusing on inves} 
ment rather than saving in studying | 


f. } 





w- 


HARBERGER: A VISION OF THE GROWTH PROCESS 19 








Las Ay 


| 








i 
i 15 
= Jenn Cum 
k Rate of 
g TFP 
'Real Cost 
| Reduction Growth 
| i 
| .08862 
r 
z 
.05 
0 
0 5 1 
Percentile of Initial Value Added 
TENT: 
A. 1970-1981 
cess itself but abot 
a, 3 .08 
might best advantn, mn Sum Cum. 
* . le 
äng of it, and hopes Cost Rate of 
so as better to Pr'Reduction TFP 
Growth 
|. i .06 
to study the comps 
ely. The rate of i 
n on capital, the ra 
rce in numbers or 04 
ution of human caj 
n average quality 4 
»presenting real co 
sufficiently differes 024148 
ly disjoint from ead 
; treated separately: 
asis to the followi 
a 
a f 0 
ss of measuring tË 6 5 i 
mphasizing its role# Percentile of Initial Value Added 
` focusing on inves B. 1982-1994 
wing in studying Ficure 7. TFP GROWTH PROFILES FOR U.S. OIL Firms 
FH o oa D> <s ~ -on 


20 


process of growth. Saving is an interest- 
ing topic in its own right, but the more 
‘open economy”? is the situation being 
studied, the less saving has to do with 
investment. Saving takes on great im- 
portance in closed-economy models fo- 
cused on aggregate growth, in which 
case it is equal to investment. It gets to 

`` be almost meaningless’as one focuses on 
the growth of cities and regions, or on 
firms and industries. 

(iii) The importance of viewing the residual 
as an umbrella covering real cost reduc- 
tions of all kinds, and of recognizing that 
we are closer to home thinking that RCR 
takes 1001 forms than that it can be well 
represented by one or two or three 
aggregate-style variables. 

(b) In principle, the accumulation of hu- 
man capital by the labor force should be rep- 
resenied in the labor contribution of the 
growth equation, or in a bifurcation of this 
contribution into one due to raw labor, the 
other to human capital. It is in a term like 
x,;w, AL, that one captures the shifting skill 
composition of the labor force. In particular, 
we capture here the higher wages that are the 
fruits of investment in education and training, 
which are the benefits that the workers them- 
selves perceive. These should be kept sepa- 
rate from any externalities education might 
have. 

It is important to try to keep this internalized 
part of the story out of the residual, so that we 
can straightforwardly interpret the residual as 
real cost reduction. 


(c) To study externalities due to education, 
training, or human capital, we should not be 
content with broad generalizations such as 
“TFP growth is higher in entities with lots of 
human capital per worker.’’ We should try to 
figure out how this externality works. Is it 
higher for firms with high incidence of human 
capital? Is it higher for industries or sectors? 
Or are human capital externalities more spatial 
in nature, making more efficient the economic 


_life of the cities, provinces, states, or nations 


which have high concentrations of human cap- 
ital? And if this is a fruitful trail to pursue, at 
what type and size of geographical units do 
these externalities typically work? 
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(d) The same goes for economies of scale? ve a br 
We should not be satisfied with vague attri$ g my know 
butions of economies of scale, say, at the levef urface ir 

P : surrace i 
of the national economy. Instead, we should 


f more fert 
pursue the matter. If the economies of scale} m 


are national, through what channels do they; cess of ec 
work, and what evidence do we have to look (g) I. 
at to see them in operation? In particular, what trying to 

is their connection to real cost reductions ad ERON 
where they really happen—i.e., at the level off sions. Cr 
the firm? Economies of scale at the levels of hopeless! 
the firm and the industry are easier to visual} growth pi 
ize. Here, too, however, the task is to check mach to ¢ 
them out—to see if the real cost reductions of much of 

firms are linked to the initial sizes of those} ween Su 
firms themselves, or of the industries in which} ladesh ai 
they operate, and of the direction (up or down)t Cenada. 1 
in which output is moving. I sible to ] 
countries 


that chara 
growth e) 


(e) Perhaps most important of all, wef 
should really try to take full advantage of ev. 
idence at the firm level. I think particularly off o¢ countr: 
identifying considerable numbers of outstand-¥ 1 describ 
ing cases of TFP improvements and TFP de} one hand 
cline, and studying them one by one to try tof we can re 
ferret out the sources of their big real cost ret of the na 
ductions and real cost increases. You can bei resorting 
pretty sure, if there have been big real cosl that seer 
reductions in a firm, some people in that firms very disp 
have a pretty good idea of where those reduc? posed to 
tions came from and how they were accom-} Benglade 
plished. By capturing this grassroots evidence, as others 


we can put some added discipline into our mF to take th 


minations about the nature of TFP at the agf werd moc 


gregate level. We must follow up on the son} My vie 
of work pioneered by Jacob Schmooklefi\ siens is s 
(1966) and Edwin Mansfield (1995). In ger}; that they 
eral, our aggregate story should be compatiblef (rate of i 
with, and comfortably contain, what we see atf? cost reduc 
the grassroots level. In particular our overall Hl overall gr 
picture of TFP improvement should comfort! tinction t 


ably accept the overwhelming evidence of the} presenting 


‘‘mushrooms’’ rather than “‘yeast’’ nature offf “explaini 

the process. grow fast 

; and (ii) 

(f) Special urgency applies to the study di “stylized 

` declining total factor productivity at both thet observe (1 

firm and the industry levels. The pervasiveness}, ing more 
of declining TFP is perhaps the most profound f; 
conclusion to emerge from the empirical links} 
that I have reported here. As a profession, wet 

obviously have been aware of its existence alt} In appr 


the industry level for virtually all studies thal}! of Policy 


ae 
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1omies of scale ive a breakdown by industry reveal it. Yet to 
ith vague attr.” y knowledge, we have barely scratched the 
say, at the leve face in studying it. I find it hard to think of 
ead, we Peat inore fertile soil for future research on the pro- 
nomies of Scali ess of economic growth. 
aannels do ther 
we have to lodh (9) I do not think that we gain much by 
| particular, whe; ing to express the relation between policies 
cost reduction: jd economic growth by a series of regres- 
e., at the level isons Cross-country growth regressions seem 
ra the levels ‘hopelessly naive to longtime observers of the 
easier to visuals owth process like myself. To us, there is too 
kask is to checly uch to question in regression lines that draw 
ost reductions ouch of their slope from the differences be- 
sizes of thosiween Sudan and Switzerland, between Bang- 
dustries in whichadesh and Brazil, or between Ceylon and 
ion (up or doWn'Canada. In contrast, it seems much more sen- 
sible to look at episodes within individual 
countries and to search for common elements 
ttant of all, What characterize the passage from bad to good 
advantage of ¢¥srowth experiences within each of the number 
nk particularly cof countries, and for those elements that seem 
ibers of outstandio describe the good growth experiences on the 
nts and TEP dene hand and the bad ones on the other. I think 
by one to try twe can reach in this way a good appreciation 
r big real cost réof the nature of the growth process, without 
ases. You can besorting to the straitjacket of regression lines 
een big real Costhat seem to draw from comparisons among 
eople in that firtyery disparate countries, lessons that are sup- 
vhere those reduchosed to be meaningful for countries like 
ey were accomBangladesh, Ceylon, and the Sudan—as well 
assroots evidenctgs others at different levels—as each strives 
ipline into our To take the next upward step in the climb to- 
of TFP at the agward modernization. 
ow up on the Sol? My view of cross-country growth regres- 
cob Schmooklesions is somewhat less negative to the extent 
Id (1995). In getthat they focus on the components of growth 
auld be compatiblirate of investment, rate of return, and real 
ain, what we see ost reduction in particular) rather than on the 
‘ticular our overaljveral] growth rate. There is also a subtle dis- 
nt should comfortinction to be drawn between two ways of 
ing evidence of théresenting cross-country regressions—(i) as 
yeast nature Oexplaining’’? why and how some countries 
ow faster than others (not recommended), 
S d (ii) as simply summarizing a series of 
plies to the study Ostylized facts” describing the experience we 
luctivity at both mserve (far preferable, and not just for its be- 
"The pervasivenesg more modest in its claims). 
s the most profouné! 
1 the empirical link’ 
As a profession, wg 
e of its existence 4 In approaching the question of the influence 
ually all studies thaf policy on real cost reduction in particular, 


V. Some Policy Implications 
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and, to a degree even on economic growth in 
general, I believe that the key words are ‘‘ob- 
structing’ and ‘‘enabling.’’ We. know from 
sad experience how easy it is for governments 
to adopt policies that get in the way of eco: ~ 
nomic growth and even turn it negative. We 
know, too, that there is no ‘‘silver bullet,” no 
single magic key that by itself opens the door 
to a paradise of prosperity and growth. 
Broadly speaking, the easiest starting point for 
a successful growth experience is a once- 
prosperous economy that has suffered from 
bad policies. Releasing that economy from its 
trammels, correcting an accumulation of past 
mistakes, can sometimes set in motion a pro- 
longed episode of astounding growth. A shift 
from policies that obstruct to policies that en- 
able growth seems to lie at the heart of growth 
‘‘miracles” like those of Taiwan, Spain, Ko- 
rea, Brazil, Indonesia, Malaysia, and China 
(among others). 

The springboard for the following listing of 
policy implications is the interpretation of the 
growth residual as representing real cost re- 
duction and the ready acceptance that in the 
real world RCR comes in 1001 different 
‘forms. 


(a) The first key observation is that people 
must perceive real costs in order to reduce 
them. Hence, policies that impede the accurate 
perception of real costs are ipso facto inimical 
to growth. Inflation is the most obvious, prob- 
ably the most pervasive, and almost certainly 
the most noxious of such policies. If I have any 
expertise based on experience in economics, it 
has to be in the first-hand observation of pro- 
cesses of serious inflation. So I ask you to 
take my word for it: the most serious cost of 
inflation is not a triangle or a trapezoid under 
the demand curve for real cash balances, nor 
is it the inflation tax. The most serious cost 
of inflation is the blurring of economic 
agents’ perceptions of relative prices. This 
happens because individual prices adjust in 
different ways and at very different rates. A 
high product price and a low input cost nor- 
mally is an invitation clamoring for new in- 
vestments to be made. This is not so during a 
serious inflation, when such a signal can eas- 
ily turn out to be “‘here today, gone next 
month’’ as both product and input prices con- 
tinue on their separate paths of adjustment to 
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the ongoing inflation. Without exception, in 
my own observations, the higher the rate of 
inflation, the worse is its effect in blurring 
agents’ perceptions of relative prices. In an 
inflation at, say, 20 percent to 50 percent per 
year, people see prices as in a morning haze; 
in one of 20 percent to 50 percent per month, ` 
they see them as in a London fog. Many em- 


pirical studies exist showing that sencas in~“ ~ 


flations are seriously inimical to growth. (See 
William Easterly, 1996.) The clouding of 
perceptions of relative prices is an important 
reason why—for it gets in the way of suc- 
cessful real cost reductions at the level of the 
individual firm. 

Inflation also inhibits growth in other, per- 
haps more obvious ways: 


(i) by diverting energies from more produc- 
tive activities to the search for mecha- 
nisms of inflation protection; 

(ii) by reducing (often very drastically) peo- 
ple’s real monetary balances, thus im- 
pacting negatively on the real amount of 
credit the banking system provides to the 
productive sector; and 
somewhat related to both (a) and (b), 
by causing people (both ‘‘here’’ and 
abroad) to invest abroad some of the 
funds they would otherwise have in- 
vested ‘‘here,’’ or (what is very close 
to the same.thing) by accumulating 
hoards of hard currencies as an inflation 
hedge. 


(iii) 


(b) A second policy implication is, in the 
words of my friend and longtime collabora- 
tor Ernesto Fontaine, avoid ‘‘prices that lie’’ 
(precios mentirosos). Talking about infla- 
tion, we focus on the blurring of the signals 
that the price system gives; here we focus on 
its giving wrong signals due to distortions 
that have been introduced, usually as a direct 
consequence of government policies. No 
good can or did come, in terms of economic 
efficiency, from tariffs of 50 percent and 100 
percent and more, giving effective protec- 
tion often of 200 percent and 300 percent 
and more. Nor can growth be fostered by 
heavy-handed price controls and interven- 
tions in credit markets. 

I am not being a religious purist here—just 
as big distortions have big costs, small distor- 
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tions typically have small costs, and all econ, jastian 
omies are distorted to some degree. Thy eper ( 
message here is that economies have to pa} osts beyo 
the price for the level of distortions thet or real ca: 
choose to have, and that one of the importan have bi 
components of that price is that distortion, jangle-tr. 
create situations where what is truly a saving. osts of tr 
of private costs is not a genuine saving of costi ite natu 
from the puis. f rzw of the economy as ¢ eps gre 
whole. mune o 
seccnd, th 
(c) Just as bad, and often even worse tha, ease a 
direct distortions, are the excess costs impose, riers € 
on an economy by ill-conceived regulation forced to 
and bureaucratic hurdles. Hernando DeSot economy 
(1989) has made the exposure of these tral rade libe 
mels in Peru into what has become virtuall sost redu 
his life’s work. Clear rules of the game att netns toe 
an essential and integral part of a well 
functioning market economy, but all too easil} (e) T 
these get supplemented by others that malt among bi 
investment, production, marketing, sales, net omies mi 
product development, etc., more costly. Labe real cost 
laws have been particularly troublesome, ofte” heen del: 
adding artificially to the cost of labor and giv! is, I belie 
ing firms a strong incentive to avoid hirin state-ow) 
new workers, simply because of the high cost of constr 
associated with any later dismissal of them. | rec} cost 
But there are loads of other items —the neel sense an 
for approvals, sometimes a dozen or mort! sense. T 
before undertaking some investment or son? salaries 
new venture; regulations that one way OF al? onerous 
other impede new entry, so as to protect stron lower-sk 
vested interests (small retailers being pro of the fi 
tected against supermarkets in many cout producti 
tries); and the complexity of tax codes an cult to fi 
their enforcement, which imposes large co: {perhaps 
pliance costs on business firms and een 





. viduals. Somehow, countries interested ify of them 
promoting growth should find ways of parin are we: 
their regulatory frameworks down to thoSvalone,”’ 
rules and requirements that are really justifitrouble, 
able in terms of their costs and benefits to thor the g; 


economy and society at large. ‘the sear: 








(d) Although international trade disto jæcomp 
tions (tariffs, quotas, licenses, prohibition fies } 
etc.) might ‘be subsumed under points-(b) ame 
(c), their importance merits a separate heat 


tionism that sometimes bordered on autarchy§*arrieq 
has been one of the main hallmarks of titare, of 
growth miracles of the past half-century [s4j*tation: 


3 






ts, and all eco 
e degree. Th 
ies have to p 


sebastian Edwards (1993) and Anne O. 
Krueger ( 1985, 1997)]. Just as inflation has 
costs beyond the area under the demand curve 
distortions they. real cash balances, so protectionism seems 
of the importati have burdens that go beyond the standard 
that distortion; »nole-trapezoid-rectangle measures of the 
is truly a Saving. osts o? trade distortions. There are at least two 
ne saving of cos quite natural explanations: first, that openness 
€ economy- as helps grease the wheels of the international 
transfer of more modern technologies, and 
econd, that firms that may once have relaxed 
B even worse tha, ease and comfort behind high protective 
ess Costs IMPOStp, riers end up having to sink or swim once 
eived regulationgy;ced to compete in a much more open- 
ernando DeSotyonomy setting. Under either explanation, 
ure of these tramýade liberalization opens up new paths of real 
become virtualligst reduction, thus providing additional im- 
of the game aetus to economic growth. 
part of a well 
y, but all too easi} (e) The recent wave of privatizations 
others that maljmong both developed and developing econ- 
keting, sales, neģmies may have important effects in enabling 
more costly. Labtjeai cest reductions that otherwise might have 
troublesome, ofttheen delayed, or not have happened at all. It 
st of labor and gis, I believe, fair to say that in most countries 
ve to avoid hititiate-cwned enterprises operate under a series 
se of the high cospf constraints that seriously get in the way of 
ismissal of them. feal cost minimization in a comparative-static 
er items —the Netense and of real cost reduction in a dynamic 
dozen or MOKense. These constraints sometimes limit the 
nvestment or SOMalaries of executives, sometimes impose 
hat one way or aherous conditions on the firm as it employs 
as to protect sttOlgwer-skilled workers, often limit the capacity 
etailers being PY9f the firm to shut down inefficient lines of 
ets in many coUsfoduction, and almost always make it diffi- 
y of tax codes avuit tc fire workers, etc. To my mind, however, 
imposes large corkrhaps the worst attribute of state-owned en- 
»ss firms and indSrprises is the ethos that often evolves inside 
ntries interested them—an ethos where middle managers 
. find ways of parilfe well advised to ‘‘leave well enough 
orks down to thoone.”’ ‘‘not rock the boat,” and ‘‘not invite 
hat are really justifoubie.’’ This ethos flies in the face of a vision 
ts and benefits to Uf the growth process that gives a huge role to 
arge. e search for real cost reductions at the grass- 
___ pots level, and that recognizes the tumult that 
ational trade distotcompanies ‘‘creative destruction’’ in all its 
enses, prohibitiotims, | thus must applaud the contemporary 
under points (b) and toward privatization. If I harbor any 
erits a separate heatlalms in this connection, they concern the 
ness (from a prote$gree to which many privatizations have been 
vordered on autarchiiried out in too much haste and with too little 
ain hallmarks of tite, often motivated by purely fiscal consid- 
past half-century [sations rather than by a general search for 
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economic efficiency. This may have led to gra- 
tuitous transfers of wealth in some instances 
and to the planting of newly private enterprises 
in soil that was not properly prepared (e.g., 
still lacking a sound regulatory framework for 
electricity rates, or intelligent rules promoting 
competitiveness in at least some aspects of 
telecommunications, etc.) 


(f) One cannot complete a list like this 
without mentioning something that most of us 
simply take for granted—a sound legal and 
institutional framework in which individuals 
are protected against arbitrary incursions on 
their property and other economic rights. This 
very basic point—recently much emphasized 
by Douglass C. North (1990), Robert J. 
Barrow and Xavier Sala-i-Martin (1994), 
Mancur Olson, Jr. (1996), and Barro (1997 ) — 
is at least potentially a vital element for a sus- 
tained process of successful economic growth. 
If it is true that spurts of growth have some- 
times occurred in the absence of such a frame- 
work, it is also true that most cases of 
sustained growth over long periods of time 
have benefitted from a sound institutional and 
legal environment. 


(g) Somewhat related to the above is the 
element of political consensus concerning the 
broad outlines of economic policy. We have 
learned from experience that very admirable 
policy reforms can take place, yet end up hav- 
ing little effect. This can happen because a 
new government comes in and reverses the re- 
form. But it can also happen because people 
fear that a new government will come in and 
reverse the reforms later on. At the moment, 
the Chilean economy is one of the jewels of 
economic growth (and general economic suc- 
cess) in Latin America. Many people point to 
the thoroughness and pervasiveness of Chile’s 
economic reforms over the last two decades or 
so. But not so many point to the fact that the 
reform package has remained essentially intact 
through several changes of ministers, and even 
more important, through two presidential elec- 
tions in which the winners came from the op- 
posite side of the political fence from the 
government that initiated the reforms. The 
confidence in the economic order of things in- 
stilled by this sequential endorsement of the 
basic framework of economic policy has to be 
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one of the important reasons for Chile’s con- 
tinued, very impressive economic perfor- 
mance. And it is important, also, in the context 
of this paper. Living in a world in which real 
cost reductions are a key dynamic force pro- 
ducing economic growth, we must look to the 
motivations and preoccupations of those who 
také the critical decisions at the level of the 
firm. For these decisions, it is not only impor- 
tant that the policy framework be good now; 
the expectation that it will stay good in the 
future is also important. Otherwise, invest- 
ments will tend to be limited to those with 
short horizons and payment periods, and much 
soil, fertile with longer-term economic oppor- 
tunities, will go unplowed. 


VI. A Vision of the Growth Process 


Let me now try to summarize my own vi- 
sion of the growth process—the major ele- 
ments of which have been presented here. In 
the first place we have the five standard pillars 
of growth—the rate of increase in the labor 
force, the rate of increase in the stock of hu- 
man capital, the increase in the capital stock 
(net investment as a fraction of value added), 
the rate of return which that investment will 
yield (or can be expected to yield) and, last 
but not least, real cost reductions stemming 
from 1001 different sources. 

Commenting on these in turn, I would note 
that increases in the labor force have taken on 
new meaning in many countries as labor force 
participation rates (particularly those of 
women) have increased. Whereas with a con- 
stant participation rate, the growth rate of the 
labor force is just a proxy for the growth rate 
of population, important increases in labor 
force participation can lead, just by them- 
selves, to significant increases in measured 
real income per head. 

Concerning increases in the stock of human 
capital, my conviction is that most of their 
contribution to growth is, on the whole, well 
measured by market wages, as in.the expres- 


sion 2,w,AL;. This’does not deny the exis- - 


tence of externalities due to an increased 
human capital stock; it simply judges their in- 
fluence on the growth rate to be modest in 
comparison with the effects of education, 
training, learning-by-doing, etc., that can be 
(and typically are) internalized by those who 
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receive them. We therefore look for the effec) - 
of human capital accumulation mainly in th 
term Z;w;AL;, and only (via externalities) at 
one of many elements underlying the grow} 
residual R’. l 
The rate of investment is a veteran on th; 
stage of growth analysis. What I would emf. 
phasize here is the.importance of Iaintainin} 
a clear separation between the rate of inves} ` 
ment and the rate of saving. Models (lik 
those of the representative consumer ) jt} 
which saving and investment are alwad 
equal are not much use even for analysis 
the national level in our modern, interde 
pendent world. They are even less useful # 
one goes down to smaller geographica 
regions, and simply cease to make sense a 
one studies the growth process at the level 
of the industry and the firm. 
The rate of return to investment has iy 
many ways been the orphan of our growls 
analysis, having been masked from view i 
our typical representation of capital’s contri 
bution to the growth rate as s,(AK/K). Hert 
the rate of return (p + 6) is totally hidde: 
from view. I deeply urge that more of us ge 
into. the habit of representing this same oa 
as (p + 6)(AK/y). I want to see more atte 









a ese 


‘tion paid to the rate of return because it olay 


such a central role in the motivation of el 
nomic agents, and also because changes inf 
are such an important element in understané, 
ing and explaining the growth residual, Rf 












This table is adapted from Harald Beyer 
(1996) work. He carried out an analysis 6 
the growth experiences of 32 countries a 
ing from Sri Lanka to the United States o 
thei income scale, and from Iceland to Austr 
lia on the geographic scale. In Table 3 

present results for his ten countries with y 
highest.and for his ten countries with the of 


est GDP growth rates from 1971-1991. In E This is all 


second column the calculated average annu Calculatio 







rate of return is shown. In the third colum) p 2m op: 
we have capital’s contribution to the grows ne Ap 
- rate [= (p + 6)(AK/y)], and in the final e vith it pr 
umn the estimated average annual rate of fro t 
growth, all over the same time period. a i Ay 
Table 3 shows an unequivocal tendency fig 
fast-growing countries to be experiencing hig, 
rates of return as well as high capital con®” >The sta 
butions and high rates of TFP improvemesé: hs (ps 
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TABLE 3—GROWTH RATES, RATES OF RETURN, AND RATES OF TFP IMPROVEMENT 


(SELECTED FROM A SAMPLE OF 32 COUNTRIES, 1971—1991) 








Ten fastest-  ___ _GDP 
growing countries growth rate 
is a veteran on (¢, Taiwan 8.83 
What I would eg. Korea 8.47 
ance of maintainig. Thailand 7.65 
the rate of inv : Hong Koig 791 
tive consumer) $ Bevesice a8 
tment are alway Cyprus 5.12 
even for analysisg Zimbabwe 4.62 
r modern, aaa Colombia 4.43 
even less use ul y iena wae 
ller geographic: 
e to make senses Ireland 4.12 
rocess at the lev 
i Median 5.35 
investment has ý Mean 6.10 
han of our grow Ten slowest- GDP 
sked from view} growing countries growth rate 
of capital’s cont Austria 2.87 
as sı( AKIK) -Hë France 2.80 
) is totally hidd a 
that more of us # . ermany 2.60 
ting this same ted Belgium 2.56 
t to see more ae Netherlands 2.52 
um Re Tp United States 2.52 
motivation of e¢ , 
cause changes it! South Arica 2.16 
ment in understa Denmark 2.15 
rowth residual, E United Kingdom 2.12 
hy I feel this w Sweden 1.84 
om Harald Beyet 
out an analysis? Median 2.52 
f 32 countries ra Mean jAi 


Rate of. Capital cont. TFP 
retum to growth rate growth rate 
15.0 3.81 3.68 
13.2 4.30 2.38 
12.5 3.68 2.96 
20.0 3.56 2.28 
14.0 2.70 0.36 
10.6 2.99 1.92 
13.6 2.42 0.97 
11.3 1.99 0.74 

9.4 1.95 1.77 
6.7 1.70 0.36 
12.85 2.84 1.83 
12.63 2.91 1.74 
Rate of Capital cont. TFP 
return to growth rate growth rate 
5.1 © 143 1.29 
6.1 1.21 0.99 
6.3 0.97 1.29 
6.8 1.06 1.60 
7.0 1.12 0.83 
9.1 1.20 0.23 
7.5 1.58 —0.97 
7.5 1.01 0.82 
9.6 0.95 0.22 
43 0.66 0.24 
6.90 1.09 0.825 
6.93 1.09 0.661 


Source: Beyer (1996), Tables HI.1.1 through HE.1.32; also Appendix I for rates of return. 










ated average ann culation of TFP a higher level of the rate of 
Tn the third colum operates to reduce the calculated TFP 
ution to the grow» 4e is a positive component of R’ and 
„and in the final uld presumably be positively correlated 


Annual rate of Te it,* but R’ is found by subtracting p AK 
time period. {® Ay; hence, in a sense, the level of p 

uivocal tendency 

be experiencing 


À 


s high capital CON! The standard expression for the residual, R = pay — 
f TFP improvemé ~ (5 + 6)dK has a ‘‘dual’’ representation, which is: 


19711991. ïn ie is all the more interesting because in the - 


should presumably~be negatively correlated 
with R’). What we are seeing here, in my 
opinion, is a genuine syndrome in which all 
sorts of good things go together. Strong real 


R = Ldw + Kd(p + 5) — ydp. This form simply says 
that the fruits of real cost reduction have to go some- 
where—either to workers (Ldw) or to owners of capital 
[Kd(p + 5)] or, in the form of lower prices to the activ- 
ity’s customers (—ydp). 
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cost reductions and high rates of return create 
attractive investment opportunities which, 
when acted upon, bring about a high capital 
contribution to growth. It should be no surprise 
that under such circumstances the GDP growth 
rate itself tends to be high. It should likewise 
be no surprise that the opposite syndrome— 
with weak real cost reductions and low rates 
of return producing fewer interesting invest- 
ment opportunities—should end up being as- 
sociated with a low capital contribution and a 
low GDP growth rate. 

Finally, we come to the residual R’ itself. 
To me, the biggest message here is to recog- 
nize the multiplicity of sources from which it 
can (and I believe does) come, and the addi- 
tivity that nevertheless is its attribute. I think 
that the term real cost reduction very neatly 
captures both these aspects in a way that ren- 
ders it preferable to terms like TFP improve- 
ment and technical advance—preferable not 
in the sense of a mechanical definition (for 
which all three are equally good), but in the 
sense of better conveying the underlying na- 
ture of the process to one’s listeners or readers. 

The next step is to recognize that of the five 
main pillars, at least three (the rate of invest- 
ment, the rate of return, and real cost re- 
duction) are key foci of decision-making 
processes at the level of the firm. I cannot es- 
cape the conclusion that the great bulk of the 
action associated with the growth process 
takes place at the level of the firm. Hence, I 
feel we should focus much more study than 
we have in the past on what happens at this 
level. And when we are not working at the firm 
level, we should pay a lot of attention to what 
happens at lesser levels of disaggregation like 
industries and industrial branches. 

Kev insights flow from taking this kind of 
focus. Few economists are aware of the per- 
vasiveness with which sunrise-sunset dia- 
grams are characterized by overshooting, or of 
the importance that firms or industries with 
real cost increases (i.e., reductions in TFP) 
play in determining the aggregate rates of real 
-cost.reduction that we see in such diagrams. 
Here we have only scratched the surface in 
digesting the evidence. But I find impressive 
the degree to which the data of Table 3 seem 
to point to a growth syndrome in which high 
rates of return, high rates of investment, high 
rates of real cost reduction, and high rates of 
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output growth all go together. I see in this mf if The role « 
sult the likelihood that real cost reductions ay? owth proce 
the big driving force, generating high rates ating the ci: 
return and calling forth high rates of inveg quickly and | 
ment and high output growth. This interpre for real cost 
tation is compatible with many exercises thy emments car 
I have performed over the years in which enhanced co: 
have tried to contrast high-growth with loy. rationalizing 
growth’ experiences. In such exercises, as j! controls, the... 
Table 3, the difference in rates of real costy pace of inve 
duction has typically been a major “source’t um. In this | 
of the difference in growth rates. ' good policy : 
Also impressive in the analysis of t ie cannot 
Jorgenson data is the degree to which the vary type analysi 
ing experiences of U.S. manufacturing in diff peing of vite 
ferent decades derives from different degres 
of bad experience (real cost increases) rath! 











op 


Schumpeter’s ‘* 
more steady (for these decades in U.S. mh 
ufacturing ) than the destructive part, whettin 
our (or at least my) appetite to look deepat r 
inquiring into why this was so. ; most develo 
The Mexican data at the firm level wer production 1 
somewhat more recalcitrant than the Jorgenso in relation tc 
data by industrial branch, but they nonetheley omies. One 
give us a clear picture of lots of winners aÑ integrally re 
lots of losers, with the losers being strongl§ dia, and ma 
characterized by falling real value added aní equivalent i 
or by falling real rates of return.* | The combi: 
‘and much 
, Wages) wor 
' dian firm t 
* Of Torre’s observations with negative TFP grow U. S. firm o: 
over three-fourths had negative growth in real val ical Indian 
added, and over one-half had falling real rates of re industry, Tl 
Less than 15 percent showed increases both in real val ris o 
added and in their real rate of return. Many of Tom. perating 
“anomalies” of negative TFP improvement together wi ton, If, as 
pcsitive real output growth stem from very high rates ciency betv 
iq 












return (p ) being imputed to the observed increases ( is typ: 
in the capital stock. quite ra ds 
This points to a problem that extends to all (or n Count p 

all) growth-accounting frameworks. Implicitly, they a untries a 
sign to new investment a marginal product based onh 

average observed gross rate of retum (J + 6) or averés; 
observed:-capital share s,. This makes little sense in cag: 

where the observed J is far above or far below the goles 

market rates. Firms earning 50 percent real return in 1 the rate c 
historical investments are very unlikely to be ‘‘requiri shi ents ca 
such a high expected yield on new investments. Simil le” rates ( 
firms going through periods of actual accounting lo investment 
may often still be investing, but it is absurd to think aam p th 
are expecting (or typically getting) negative returns $ industries b 
their new investments (AK). There are, I think, good mat 
sons for us to experiment with alternative ways of sel that wil 








The tole of policy in this vision of the 
wth process is an ‘‘enabling’’ one. By cre- 
ating high rates g the circumstances where firms can 
gh rates of invẹickly and accurately predict opportunities 

th. pals eal cost reduction and act on them, gov- 
any exercises ments can guide the economy toward an 
. years in whickpanced contribution of RCR to growth. By 
-growth with lofionalizing and/or eliminating barriers and 
ch exercises, asřptrols, they may also lead to an increased 
ates of real oe of investment and increased rates of re- 




















a major “‘sourcin, In this view, the connection between 
rates. d policy and growth is not mechanical, and 
e analysis of {ùs cannot easily be captured in regression- 
è to which the vape analysis, but that does not stop it from 
anufacturing in ding of vital importance. _ 
m different degre] want to give special weight to another role 
st increases ) rat, ‘growth-enabling”’ policy actions, which 
od experience (ty to do with how policies, the effects of 
the creative parthich might typically be considered to be 
destruction’’ wmparative static, can nonetheless turn out to 
cades in U.S. mifluence economic growth rates over ex- 
ctive part, whettinded periods of time. 
tite to look deepThis story begins with a recognition that 
as so. ost developing countries have typically used 
the firm level weoduction techniques that were “‘backward’’ 
t than the Jorgensrelation to those used by the advanced econ- 
but they nonethelities. One way to verify this is to imagine 
` lots of winners degrally replicating, say, a U.S. factory in In- 
osers being strona, and manning it there with Indian workers 
eal value added aquivalent in skill to their U.S. counterparts. 
F return.’ ìe combination of lower construction costs 
id much lower operating costs (mainly 
Ages) would permit this hypothetical new In- 
an firm to undercut the prices of both the 
ith negative TFP gro. firm of which it was a copy and the typ- 
ive growth in real val Indian firm currently active in the same 
falling real rates of reinssry, This says that the typical Indian firm 
ncreases both in real va i re Seca : 
f return. Many of TowPerating on an inferior production func- 
improvement together va. If, as I believe, the difference in effi- 
ie from very high ratefncy between U.S. and developing-country 
e observed increases (Ans ig typically large, there is much room for 
ice malities nefte rapid improvements in the developing 
‚works. Implicitly, they ptries as they learn how to ‘‘adopt and 
eginal product based ont > >>= ov ; Sie ii 
f return (p + 4) or avi 
s makes little sense in ¢ 
bove or far below the gẹ f 
O percent real return in the rate of return (p) imputed to new investments. 
ygunlikely to be ‘“‘requiti ments can be developed for using arbitrary but ‘‘sen- 
new investments. Simik® rates (like 10, 15, or 20 percent), or for applying 
of actual accounting loBvestments by individual firms and/or industries rates 
out it is absurd to think {turn J that are obtained not from those specific firms 
zetting) negative retum:?dustries but from broad sectoral (e.g., manufacturing) 
There are, I think, good *tnomywide averages. I am pursuing these avenues in 
ch alternative ways of sef% that will be reported at a later time. 
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adapt’ already-known techniques from the 
advanced countries. 

I would assume that the incentives for ‘‘con- 
vergence’’ are always present, but that they 
have typically run into barriers and trammels 
of many kinds in the poor countries of ihe ` 
world. Reducing the barriers and loosening the 
trammels permits the more rapid convergence 
to techniques that are closer to the frontier of 
knowledge. 

The way I see the influence of policy in 
growth, it is simply not true that implementing 
enabling policies typically permits a quantum 
jump from the old to the truly modern. It is 
more accurate to describe it as speeding up 
what will in any case be a very lengthy pro- 
cess. Personally, I like the analogy to a hy- 
draulic system in which a large vessel with a 
high water level and lots of water is connected 
to a much smaller and narrower vessel with a 
much lower level of water. Physical laws dic- 
tate a tendency for the water levels to equalize 
in the end. But this can take a very long time 
if the tube connecting the two vessels is tiny, 
or is clogged by extraneous matter. 

The policies that we consider good for 
growth have the attribute, in this analogy, of 
removing the extraneous matter and/or enlarg- 


- ing the connecting tube. But even with the best 


modernizing policies, the tube remains small 
enough so that it takes many decades for a 
country to pass from poor to middle income or 
from middle income to rich. If somehow the 
hydraulic connection could be made so large 
as to bring about an almost instantaneous full 
adjustment of the water level, then we would 
say that good policies mainly represent level 
adjustments. But observing even the best of 
real-world growth experiences, I think we 
have to conclude that the adjustment is going 
to be extended over a lengthy period in any 
event, thus causing the big observable result 
of better policies to bé a higher growth rate 
over an extended period rather than a discrete 
jump to a totally different level ~~" “Y 

This way of looking at the world also leads 
to some observations on the current literature 
dealing with convergence. I have long been 
mystified by allusions to an ultimate conver- 
gence of growth rates among countries, or an 
ultimate convergence of levels of output per 
head. To me, the natural convergence is prod- 
uct by product, not country by country. And 
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among products there may be some where 
current techniques will never be further 
improved. Those products will have no real 
cost reductions or TFP improvements in the 
future, while others will enjoy huge advances 
of productivity. My guess is that unlucky 
countries (Bhutan, Nepal, Mongolia?) may 
. always lag way behind the pack, while luckier 
ones (Taiwan, Argentina, Brazil?) may one 
day hope to be among the world’s leaders. So 
convergence comes through as a general ten- 
dency, and quite surely a general possibility, 
for the production techniques used in making 
any given product to improve as enterprises 
using ‘‘backward’’ techniques learn of better 
ones, and even more important, learn about 
how to put them into practice. I do not believe 
that much more than this can be said of con- 


vergence as areal-world force. Wage rates for 


given types of labor tend toward a rough 
equality across regions in a country because 
of the ease of migration in response to per- 
ceived. wage differences. The forces at work 
internationally are both weaker and more 
complex, but the big message here is that 
the improvement of technique in any one in- 
dustry does nothing to improve the wages of 
labor in just that industry. After an improve- 
ment, the industry may end up hiring more or 


less labor, but will presumably choose the. 


amount of labor so as to bring marginal pro- 
ductivity into correspondence with market 
wages. So technological improvement has an 
effect on wages via supply and demand in the 
national labor market, not through any direct 
link from technical improvement to wages 
(for given skills, etc.) at the level of the in- 
dustry or the firm. 

As my final point, let me return to the 
thought that the justification for perfecting the 
functioning of the market system does not lie 
only in reducing the efficiency costs associated 
with each period’s operation of the economy. 
Perfecting a country’s economic policy does 
not only cause it to move from a path at 
around, say, 90 percent of its potential output 
to another. equal to 95 percent of potential, 
somehow given in advance. That gain would 
certainly be a worthwhile gain, and it would 
amply justify a lot of hard work involved in 
achieving it. But that gain is still fundamen- 
tally comparative static in nature. 





_policies emerged as part of a consensus of sg, 








if people have 


a sense that the perfecting of economic p eri sented pol 
cesses can also in nearly all cases be justits an pr: 
as greasing the wheels of the constant searn$ policies of t 


for new avenues of real cost reduction. To Me to ‘the CEOs 
extent that economic Teforms do so, they bfinat. take ai 
“quest for Te: 
ang an environr 
better ways ‘are found of doing ee not ju cess of ‘‘cr 
in so-called “‘production’’ but also in sug wonders. 
mundane areas as merchandising, sales, ff 
nance, insurance, and many more. ] 

Some years ago, in a book that I edited calle, 
World Economic Growth (1984), I wrote 
essay called ‘‘Economic Policy and Econo: 
Growth,” in which I listed ‘‘13 lessons” 
I thought followed from the papers presen 
in the volume, recounting the growth ex 
ences of countries as disparate as Ghana $ practice in 
Taiwan, or Japan and Sweden. These lessons- jnto compo 
basically focused on thinking about policies. (a) To 3 
terms of their economic costs and benefits capital, one 
could easily be read as a reprise of the of dollars of 
comparative-static story. But they were m capital stoc 
meant as such—it is quite relevant that th ficient way 
appeared i in an essay concerned with world ea} (value add: 
nomic growth. The point was that these sensit? the GDP de 
outputs of 
rious economists, each an expert in his parti} omy add u 
ular country’s history, focusing attention on tk; the require 
process of economic growth. ! | measured 

A few years later, and as the theme offi way cash 
different concept, John Williamson (199(g dard, ex px 
ae the term, the ‘‘Washington conse. done at the 

* Williamson listed ten points, cove i ( Capital to ; 

ay very similar to my 13 lessons. He alg. 
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J aggregate 
produced a pithy summary that captures tj ¿ciation rat: 
essential thrust of both his and my listingg:the net in, 
‘Macroeconomic prudence, outward orient¥-Period in c 
tion, and domestic liberalization.’’ He, tog:.Contributic 
and the members of the Washington profey. is (pj; + 6 
sional establishment whose apparent conse, tigh rates 


sus led to Williamson’s list, were not j 


eae: 









thinking of comparative-static gains as the. °(b) To 
reached their conclusions about policy. zbor factor 
were thinking about ways to move economi% feakdow 
from slow growth, stagnation, and even #0 i). The 
some cases negative growth, to-a health: Where Wi 
prosperous flowering of economic progre and AL; 
Similar views were more recently affirmed} X Bory i, S 
Stanley Fischer (1993). f g ties of lal 
To me, the dynamics of real cost reducti: “Wäcult, ; 
are at the very least an important piece of whe: BTègates f 






nople have in mind when they list efficiency- 
of economic pryiented policies as essential ingredients of a 
J.cases be justifi rogram promoting economic growth. It is 








Nhat take away trammels that impede their 
an economy to quest for real cost reductions, and that create 
, new, cheaper, avin environment in which Schumpeter’ s pro- 


ing things, not Jles of ‘‘creative destruction’ can work its 


















È The vision of the growth process presented 
licy and Econo in this paper leads one almost inevitably to 
“13 lessons” ths some methodological twists—twists which, if 
e papers presentithey are not new, at least differ in significant 
the growth expeways from what I take to be the most common 
ate as Ghana a practice in breaking economic growth down 
en. These lessonsyjnto components. : 
ng about a (a) To measure the real rate of return to 
alae one must express the numerator (real 


1984), I wrote fe 





f a consensus of Outputs of all the subaggregates in the econ- 
expert in his part{Omy add up to the GDP, and one also satisfies 
ing attention onfithe requirement that the capital and return be 
jiteasured in the same units. This is also the 
as the theme o way cash flows would be deflated in a stan- 
illiamson (199@ard, ex post project evaluation. When this is 
ashington cons done at the aggregate level the contribution of 
en points, cove capital to growth is (p + 6) AK where p is an 
13 lessons. He #aggtegate rate of return to capital, ô the depre- 
that captures (fiation rate (including obsolescence), and AK 
is and my listine net increment to the capital stock in the 
ce, outward oriesPeriod in question. Ata subaggregate level, the 
alization.” He, t€0ntribution of capital to growth in activity j 
Washington prof. í (p; + 5;)AK;. At both levels, we find that 
at appaicit cons igh rates-cf return are an important compo- 
| list, were not jent of most successful growth episodes. 
-static gains as | (b) To capture the great diversity of the la- 
s about policy. Th&or factor, we would like to have a very fine 
b to move economreakdown of labor into categories (indexed 
Sation, and ever Y i). The labor contribution is then Ew AL, 
phere w; represents the real wage of category 
; the change in hours worked by cat- 
recently affirmedfS0ry i. Since the number of relevant catego- 
‘Bes of labor is huge, any such breakdown is 
$ ticult, and gets more difficult as one disag- 
portant piece of WR °&gates from economy to sector to branch to 
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industry and to firm. This Gordian knot can be 
cut by a simple assumption, similar to what is 
done in most. countries to convert residential 
construction to real terms. There, one builds a 
price index of a ‘‘standard house” př, and 
then obtains a quantum of construction C* by 
dividing total construction outlays by the price 
of the standard house. In the resulting aggre- 
gate, each individual residence (i) gets attrib- 
uted a quantum of housing equal to p;/p; . In 
this work I define a standard wage w*, which 
I assign to ‘‘standard labor’’ or ‘‘raw labor.”’ 
The excess of anybody’s actual wage over w* 
is attributed to human capital. The returns to 
natural ability, as well as to formal education, 
training, and experience belong there, under 
this interpretation. High returns due to a dis- 
torted wage structure are not appropriately 
attributable to human capital, but the meth- 
odology would nonetheless be correct in at- 
tributing to the affected labor a marginal 
productivity that is measured by the distorted 
high wage. 

The ‘‘labor contribution’’ as measured by 
w*AL* is equal to w*(2;(w;/w*) AL; + 
2; L; A(w;/w*). The second term will be zero 
if the structure of relative wages remains con- 
stant, or even if the weighted average premium 
does not change. Any changes in the weighted 
average premium will cause the calculated re- 
sidual to be different from those calculated by 
other methods. 

The ‘‘two-deflator method’’ is characterized 
by the use of a single numeraire-deflator (say, 
the GDP deflator), by the treatment of the 
quantum of output as value added divided by 
the numeraire-deflator, and by the use of a 
standard wage w* and a quantum. of labor L* 
equal to the wages bill divided by w*. This is 
the method used by .Beyer (1996), Robles 
(1997), and Torre (1997) in their work re- 
ported in this paper. . . 

It goes without saying that the two- deflator 
method is rough. But it is also tremendously 
robust and easily applied. I think of it as being 
really designed for use at the firm level, where 
very commonly we can get data in value 
added, on gross investment, and on the wages 
bill, but know nothing (from standard sources ) 
about the quantum of output or about the num- 
ber of total hours worked (or even the total 
number of employees used). This opening of 
wide new vistas, of huge new data sets, is what 
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I consider the strongest argument for the two- 
deflator approach. 

But there are other pluses as well. First, at 
the aggregate economy level, the two-deflator 
approach comes very close to the traditional 
appreach. In rate-of-growth terms, we have 
(R*/y) = 8y — Sk8k* — SL8L», Compared with 


(R/y) = 8y — -5:8r=7-StBnx Where g refers to, | 


growth rate, and y to GDP. K* differs from K 
in being built up. from gross investment de- 
flated by the GDP deflator, while K is built up 
from gross investment deflated by the invest- 
ment deflator of GDP. L* is in principle much 
more refined than N (number of workers), but 
its measurement can be influenced by a wid- 
ening or narrowing skill premium. (R*/y) 
likewise differs from Jorgenson’s residual 
(R'/y) = 8y — LS Bxj — 
use of different capital deflators for different 
categories of capital. (The implicit labor 
breakdown in L* is much finer even than 
Jorgenson’s, but Jorgenson does not have to 
contend with the possibilities of widening or 
narrowing skill premiums—at least not 
among the labor categories he works with, 
which are typically broken down by gender, 
age, education, occupation, industry, and em- 
ployment status.) 

The bottom line is that when Beyer com- 
pares his two-deflator results, at the national 
aggregate level, with those of others using dif- 
ferent methods, he finds on the whole only 
modest differences. [See Beyer (1996); 
Harberger (1998) .] 

When one uses the two-deflator method at 
the industry level, one often has the possibility 
to adjust the quantity variable so as to make it 
correspond with that of the more traditional 
approach. Thus, we may start by using dy;* (= 
p;dy; + y;dp;) as the > quantity variable, and cal- 
culate a residual R* using that concept. Then 
we may get an adjusted residual by taking 
RÝ — y,dp;. This is easy to do so long as we 
have decent data on dp,, the relative price of 
j, which we often do at the branch or industry 
level. When Jorgenson’s residuals are com- 


pared with the two-deflator residuals, with and_ 


without price adjustment, I find the differences 
without adjustment to be small enough to be 
quite acceptable. With adjustment, the degree 
of agreement between the two approaches is 
quite notable (85 percent of differences less 
than one percentage point of per annum 


Z;Sege; mainly in his. 
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growth). [See Robles (1997); Harberg eyer, Harald. 
Cross-Cour 


(1998).] 

When one gets down to the firm level, t paper, Unis 
two-deflator method is in its element. Rarel}.. Jes, 1996. 
do we have decent time series on the price iptgoskin, Micha 
dex of a firm’s output. Treating many firms}} economic g 
one industry, one might then give all of theg- stitution, r 








that point the distribution of adjusted TFP 
siduals among the firms would end up diffe! ries. Wash 
ing from the original two-deflator distributia! 1967. 
only by a constant. When expressed in percen peSoto, Herna 
age terms we would have (R¥/y¥) = g%4 ble revolut 
SuBR — Sugey for each firm j without adju Tauris, 19€ 
ment, while with adjustment we would hay}fasterly, Willi 
Rř/y; = the same expression minus gz, ti pansionar: 
rate of growth of the industry’s relative prig. tions.” E 
index (the same for all firms). . Forum, Af 
So in the great bulk of cases one ends wEdwards, Seb 
















with something quite like the two-deflatet alization a 
method when working at the individual fing tries.” Jo 
level. The consolation is that the residu September 
terms of individual firms, calculated for tį Fabricant, Sc 
whole economy, add up to a residual tem economic 
for the aggregate—in the sense that outpu Report of 
sum to GDP, the L;* sum to L* for the eco nomic R 
omy, the K; * to K* “for the economy, etc. Fe 1954. 
more details on methodology, see Harberg Fischer, Stan! 
(1998). |} Factors in 
Monetar 
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The provocatively titled report of the National Com- 

ion on Excellence in Education (1983), A Nation at 

| details the decline of performance of U.S. students 

e 1970°s. More recent data can be found in Daniel M. 

tz (1987). Modest gains in performance on standard- 

achievement tests, followed by a leveling off, well 

w peak scores of the early 1960’s, characterizes the 

1980’s and 1990's. 
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Competition Between Private and Public Schools, 
Vouchers, and Peer-Group Effects 


By DENNIS EPPLE AND RICHARD E. ROMANO * 


A theoretical and computational model with tax-financed, tuition-free public 
schools and competitive, tuition-financed private schools is developed. Students 
differ by ability and income. Achievement depends on own ability and on peers’ 
abilities. Equilibrium has a strict hierarchy of school qualities and two- 
dimensional student sorting with stratification by ability and income. In private 
schools, high-ability, low-income students receive tuition discounts, while low- 
ability, high-income students pay tuition premia. Tuition vouchers increase the 
relative size of the private sector and the extent of student sorting, and benefit . 
high-ability students relative to low-ability students. (JEL H42, 128) 


policy figured prominently in recent presi- 
dential elections. The debate has centered on 
issues of school choice, including voucher sys- 
tems (Karen De Witt, 1992). Typical voucher 
proposals provide students attending private 
schools a tax-financed, school-redeemable 
voucher of fixed amount toward (or possibly 
covering) tuition. Although a 1993 California 
referendum for vouchers was defeated, policy 
change at state and local levels abounds, as 
does change in the private educational sector. 
The state of Minnesota and school districts in 
30 states allow residents to choose the public 
school their children attend.” The city of Mil- 
waukee introduced a voucher system in the 
1989—1990 school year. A number of private 
school and private—public school initiatives 
are developing (see e.g., John F. Witte et al., 
1993; Steve Forbes, 1994; Steven Glazerman 
and Robert H. Meyer, 1994; Joe Nathan, 1994; 
Newsweek, 1994; Wall Street Journal, 1994; 
Steven Baker, 1995; Jay P. Green et al., 

1996). Educational reform emphasizing in- 
creased school comnetition with an increased _ 


? Public funding of nonsecular schools and consider- 
abie freedom of school choice has been practiced for years 
in England (Daphne Johnson, 1990) and much of Canada 
(Nick Kach and Kas Mazurek, 1986). These choice sys- 
tems support horizontal differentiation in schooling and 
safeguards exist to limit vertical (quality) differentiation. 
Our analysis is concerned primarily with the effects of a 
voucher system on vertical differentiation. 


a 
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` role of the private sector is at the forefront of 
the policy debate and recent policy initiatives. 

The modern economic case for vouchers 
and increased educational choice was made 
by Milton Friedman (1962). The academic 
educational and political-science professions 
have since considered the pros and cons of 
voucher systems and educational choice (John 
E. Coons and Stephen D. Sugarman, 1978; 
Myron Liberman, :1989; John Chubb and 
Terry Moe, 1990). Economic analysis of 
the interaction between public and private 
schools, and of related policy instruments like 
vouchers, is only beginning to emerge. This 
paper continues the study of the ‘‘market’’ for 
education by developing a model that focuses 
on the interaction between the public and pri- 
vate educational sectors and also examines the 
consequences of vouchers. We describe the 
equilibrium characteristics of the market for 
education with an open-enrollment public sec- 
tor and a competitive private sector. 

Our model embodies two key elements of 
the educational process. First, students differ 
in their abilities. Higher ability is assumed to 
increase a student’s educational achievement 
and that of peers in the school attended. Sec- 
ond, households differ in their incomes, with 
higher income increasing the demand for ed- 
ucational achievement. A student in our model 
is then characterized by an ability and a house- 
hold income, a draw from a continuous bi- 
variate distribution. A school’s quality is 
determined by the mean ability of the student 
body, reflecting the model’s peer-group effect. 
We characterize the equilibrium distribution of 
student types across public and private schools 
and examine the tuition structure of private 
schools, assuming that student types are veri- 
fiable. We develop a theoretical and compu- 
tational model in parallel, with the latter 
calibrated to existing estimates of parameter 
values. Equilibria are simulated for a range of 
voucher values. 

Key characteristics of an equilibrium are the 
following. A hierarchy of school qualities will 
be present, with the set of (homogeneous) 
public schools having the lowest-ability peer 
group and a strict ability-group ranking of pri- 
vate schools. The equilibrium student bodies 
of schools correspond to a partition of the 
ability—income-type space of students with 











stratification by income and, in many cas. The entry of 
stratification by ability. As Figure 1 from o¢‘yent more effic 
computational model illustrates, type spacef:-hools caused t 
then carved into diagonal slices with eg welfare (and act 
higher slice making up a private school’s somputational : 
dent body and with the bottom slice comprpystributional ef 
“ing the public sector. ter, the magni 
The normality of demand for a good depends on the « 
group leads relatively high-income students}peer ability and | 
cross subsidize the schooling of relativeforoduction func 
high-ability students, producing the latter p#eyidence to guic 
tition. Private schools attract high-abilitiuch complemer 
low-income students by offering them tuiti¢the premium to í 
discounts, sometimes fellowships. Even wi largest proporti 
free entry, schools price discriminate by ifthen accrue to | 
come against students who are not on (dents. For exam 
margin between switching schools. The eg of $10,000 and : 
librium differentiation of schools and econspercentile has a 
mies of scale in education preclude perfetcent of income 
competition for every type of student. Nevajdents of low i 
theless, this price discrimination does not difremain in the 
rupt the internalization of the peer-groydollar voucher 
externality by private schools. An equilibriug welfare losses 
without a public sector is Pareto efficient give bears emphasi: 
the equilibrium number of schools. Becaw lic and private 
free public schools do not price the peer-gro : tive providers 
externality, an equilibrium with public scho argue that priv 
is Pareto inefficient. stive and that 
In the computational model, we employ} voucher prog: 
Cobb-Douglas specification of utility and a effectiveness. 
ucational achievement:which incorporates ty concludes frc 
peer-group effect. The parameters are ca that competi! 
brated to U.S. data from various sources. We provement v 
„compute approximate equilibria for vouch, achievement 
values ranging from $0 to $4,200 per stude- ` 
($4,222 equals the expenditure per studenti 























public schools in 1988).* With no vouche- “Caroline M. 
the predicted percentage of students in Uf that private-scho 
public sector is 90 percent (the actual vali Tio Wil 
for the United States is 88 percent). As HF e niea 
voucher is increased, the size of and me'i attend colleg: 


ability in the public sector decrease. With findin 


$2,000 voucher, for example, the percen ay cas while be 
of students remaining in the public secti; of educa 


equals 70 percent, and the mean ability 48 Thom 
clines by 15.8 percent. 4 


special cases. This integer problem and our approximal§): 
approach are discussed later in the paper. 3 











land, in many cx The entry of private schools and conse- 
As Figure 1 from&.nt more efficient sorting of students across 
istrates, type Spag4-oo1s caused by vouchers increases average 
nal slices with e&yfare (and achievement ) only a little in our 
| private school’s &nputational model, while having larger 
vottom slice compétributional effects. As we discuss in detail 
r, the magnitude of the aggregate effect 
and for a good pends on the extent of complementarity of 
h-income student#er ability and own ability in the educational 
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Pareto efficient gitars emphasizing that our model takes pub- 
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ve and that the competitive effect of a 
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)? With no pares Caroline M. Hoxby (1994, 1996) provides evidence 
ge of students in itprivate-school competition increases public-school ef- 
cent (the actual yvattiveness. William N. Evans and Robert M. Schwab 
s 88 percent). As 995) find that Catholic private schools are more effec- 
hegi f and gm inducing students to complete high school and also 
the sıze of an Matend college. These studies take on the challenge of 
:ctor decrease. Witding instruments that predict well private-school atten- 
ample, the percentite while being independent of unobserved determi- 
z in the public sec ts of educational achievement. regan 
“1: ceming the quality of the instruments used. See 
1 the mean ability mas J. Kane (1996) for a discussion of Hoxby’s meth- 
logy. David N. Figlio and Joe A. Stone (1997) employ 
Uerent set of instruments than Evans and Schwab and 
that, at current input levels, religious private schools 
less effective than public schools in producing 
tivate schools in our m@*vement on standardized exams in math and science 
etitive equilibrium excel Ncnreligious private schools are more effective). See 
oblem and our approximate (1996) and Figlio and Stone (1997) for references 
‘in the paper. ther studies. 
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losses of the magnitude that emerge due to 
reduced peer quality in our computational 
model. Our analysis delineates the allocative 
effects of vouchers and demonstrates a poten- 
tial for significant redistribution. : 

A theoretical-economics literature on edu- 
cation is beginning to emerge. Charles A. M. 
de Bartolomé (1990) develops a two- 
neighborhood model of the provision of public 
educational inputs (quality) with two ability 
types and peer-group externalities. He shows 
that the voting/locational equilibrium is inef- 
ficient because the median voter does not in- 
ternalize the consequences of migration on 
peer groups in choosing the input level. No 
independent income variability characterizes - 
students in his model. Raquel Fernandez and 
Richard Rogerson (1996) introduce income 
differences in a two-neighborhood model of 
the provision of inputs but abstract from peer- 
group effects. They examine the effects of 
redistributive policies and direct controls on 
inputs. Neither model has a private sector. Our 
analysis is differentiated by its consideration 
of a private sector and its two-dimensional, 
continuous type space. In a normative analysis 
of student groupings in the presence of peer- 
group effects, Richard Arnott and John Rowse 
(1987) show how a social planner would max- 
imize the sum of achievements in allocating 
students of various abilities across classrooms. 
We analyze equilibrium outcomes, and most 
of our analysis is positive. 

Joseph E. Stiglitz (1974), Norman J. 
Ireland (1990), Ben Eden (1992), Charles F. 
Manski (1992), Michael Rothschild and 
Lawrence J. White (1995), Epple and 
Romano (1996), and Gerhard Glomm and B. 
Ravikumar (1998) consider the consequences 
of a private sector for education. Stiglitz, 
Glomm and Ravikumar, and Epple and 
Romano are concerned with the existence and 
properties of voting equilibria over tax- 


` finadticed, public-schooi expenditure in the 


presence of a private alternative. Ireland ana- 
lyzes the effects of vouchers on utilities and 
the quality of the public alternative, taking the 
tax rate as exogenous. Individuals differ only 
by income, and the private alternative can be 
purchased continuously in all these analyses. 
Hence, the private sector is relatively passive, 
and issues of financial aid and differences in 
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student ability across schools do not arise. Our 
model is distinguished by having differences 
in ability and related peer-group effects, and 
by providing an active role for private-sector 
schools. 

Eden (1992) analyzes vouchers in a purely 
private market system of provision of edu- 
cation having two ability types and peer- 
group effects. A voucher equal to the 
difference between the social and private ben- 
efit of education to each ability type is shown 
to induce socially optimal provision of edu- 
cation. Key differences in our analysis in- 
clude our consideration of the interaction 
between the public and private sectors, our 
exploration of the implications of continuous 
differences in ability and income, and our at- 
tention to positive issues. Manski (1992) 
pursues a computational analysis of vouchers 
that also considers peer-group effects among 
other aspects of education (especially var- 
ious objectives of public-school decision 
makers). Our models differ in a number of 
ways. Most importantly, we permit private 
schools to discriminate in their tuition poli- 
cies, with many consequences. Rothschild 
and White (1995) analyze a competitive 
model with consumers also inputs to produc- 
tion (a peer-group effect), using higher ed- 
ucation as their primary example. We share a 
concern for market pricing in the presence of 
an externality. Differences in our model, 
among others, are the presence of a public 
sector, a more detailed specification of peer 
effects and demand for education, and student 
variation in both ability and household in- 
come. Our attention to the implications for 
pricing, profitability, and school qualities of 
a peer-group effect deriving from student 
abilities, the allocation of students according 
to ability and household income and the re- 
lated distribution of educational benefits, and 
the effects of vouchers are not concerns in 
Rothschild and White. 

Private schools are cases of clubs with non- 
anonymous crowding due to the ability- 
dependent externality and schools’ power to 
price it. Suzanne Scotchmer (1994) provides 
an excellent synthesis of this literature. We 
follow this literature in our competitive spec- 
ification of private schools as further dis- 
cussed below. 






The next section presents the model. § e: 
tion II develops the theoretical results. Tia public 
computational results comprise Section Ify(y, — P» 


Concluding remarks follow. An Appendjajl positive 
contains some of the detail. ures the p: 


I. The Model 
ae SOPAS? school’s qt 
Household income is denoted y, and eafthe mean a 
household has a student of ability b. The joig work we a 
marginal distribution of ability and income}? variation ir 
the population is denoted f(b, y) and is af Uis alsc 
sumed to be continuous and positive on jf ‘‘single-cr 
support, S = (0, bmax] X (O, Vmax]. All student 
‘ attend a school since we assume that free pull 
lic schooling is preferred to no schooling. (1) 
household decision maker’s utility functi - 
U(-), is increasing in numeraire consumptid Hence, for 
and the educational achievement of the houst.indifferenc 
hold’s student, and it is -continuous and twig higher-incc 
differentiable in both arguments. Achiev ence curve 
ment, a = a(6, b), is a continuous and increas below. Th 
ing function of the student’s ability and th come elas 
mean ability of the student body in the schog quality tha 


attended, 6.° Let y, denote after-tax income an types.’ On 
for SCT is 


one having 



































* The influence of ability on own educational achiev 
ment is well documented and not controversial. En 
Hanushek (1986) provides an excellent survey. In the 
nomics literature, Anita A. Summers and Barbari $ =—————— 
Wolfe (1977) and Vernon Henderson et al. (1978) fis: Sorensen and 
significant peer-group effects. Evans et al. (1992) adj and Mark Be 
for selection bias in the formation of peer groups and s 5.1992). How 


that it eliminates the significance of the peer group ins : (Robert E, S 


be interpreted as suggesting that peer-group effects doi: } 
exist, but as demonstrating that scientific proof of tha% Comic grc 
effects is inadequate. Note, too, that their work supp: We belie 
the notion that peer-group variables enter the utility fwg, 
tion since a selection process does take place. The pip. 


chology literature on peer-group effects in education W; d for e 
contains some controversy. In their survey paper, Richt: siderable div: 
L. Moreland and John M. Levine (1992) conclude: ff Me elastici 
F Present, estin 

The fact that good students benefit from ability Ẹ iba elasti 
grouping, whereas poor students are harmed by it, Ẹ ay Shapirc 
suggests that the mean level of ability among class- {È Househo 
mates, as well as variability in their ability levels, $ an inve 
could be an important factor. The results from sev- K ef p mulat 
eral recent studies . . . support this notion. By ese moti 

Rime e cc 

This squares with our reading of the literature (Summ iy a zeroi 
and Wolfe, 1977; Henderson. et al., 1978; Chen-Lin KU}; » this in 
(D would be 


and James A. Kulik, 1982, 1984; Aage B. Sorensen, 19 
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y uition expenditure, the latter equal to zero 
Be a public school is attended. Thus, U = 
Ion — P» 29, b)), with Ui, Us, ai, and a 
Ey positive. The achievement function cap- 
‘a es the peer-group effect in our model, dis- 
€ sed further below. To maintain simplicity 
Fad highlight the role of peer groups, a 
F-hool’s quality is determined exclusively by 
eshe mean ability of its peer group.° In ongoing 
Æ ork we are extending the model to include 
jation in educational inputs. 
no schooling. $! 


OU/06 
A >0. 
S utility functig: 


raire consumpti Jence, for students of the same ability, any 
ment of the ho 


the model. 
ical results. 








ndifference curve in the (6, p)-plane of a 
higher-income household cuts any indiffer- 
nce curve of a lower-income household from 


Ps ability and fome elasticity of demand for educational 
body in the schfuality that is positive at all qualities for all 
ter-tax income dypes.’ One set of sufficient conditions on U 
or SCI is U,, = 0 and Up = 0, with at least 
Dne having strict inequality.® 

vn educational achi r 

hot controversial. Hr 

Jent survey. In theg 


hmers and Barbara 
porensen and: Maureen T. Hallinan, 1986; Adam Gamoran 
ns et al. (1992) agd Mark Berends, 1987; Jennie Oakes, 1987; Gamoran, 
f peer groups and s#292). However, there are alternative interpretations 
of the peer group in Robert E. Slavin, 1987, 1990). 

opping out of sch. For simplicity, the possibility that dispersion in peer 
i their results should Šility also affects achievement is not built into our model. 
‘eer-group effects do tbland Benabou (1996b) explores the consequences for 
conomic growth of dispersion in human capital. 


scientific proof of thy’ 1 
that their work supp% We believe this to be uncontroversial. While we know 
les enter the utility fug:89 empirical studies that use direct measures of edu- 
es take place. The f tional quality, a substantial empirical literature on the 
effects in education #™and for educational expenditure exists. Although con- 
sir survey paper, Rick’°table diversity in magnitudes of estimates of the in- 
: (1992) conclude: _PMe elasticity of demand for educational spending are 
Sent, estimates using a’ Variety of approaches find the 
benefit from ability OMe elasticity to be positive (Daniel Rubinfeld and 
nts are harmed by itay Shapiro, 1989). 

ili f Households may consider education a consumption 
pod, an investment good, or a combination of the two. 
er formulation can be interpreted to accommodate any 
E these motives. However, for households not subject to 
Towing constraints, a pure investment motive would 
f the literature ( Sum% ply a zero income elasticity of demand. For such house- 
lds, this in turn would imply that the SCI condition in 


? Would be only weakly satisfied. In light of the empir- 























ort this notion. 
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Preferences for school quality might also 
depend on ability. We say preferences satisfy 
weak single crossing in ability if 


-,{-OU100 » at ee 

a Sui) a =0 
which implies a weakly positive ability elas- 
ticity of demand for quality. However, because 
the pertinent empirical evidence is mixed and 
scarce, we postpone restricting preferences in 
this regard until necessary.” In our computa- 
tional model and to illustrate our more general 
theoretical results, we adopt a Cobb-Douglas 
specification of the utility function: 


(2) U = (y, — p)a(0, b) 


a(0, b) = 07b? 
>00 y>0. 


While (2) satisfies SCI, it embodies the ‘‘neu- 
tral’? assumption of zero ability elasticity of 


demand: 
0U/00 
o( Suge) / = 0, 


Our computational results are not driven by 
own-ability effects on the demand for educa- 
tion. Keep in mind, too, that the theoretical 
results do not assume specification (2). 

A school’s costs depend only on the number 
of students it enrolls, since inputs vary only 
with size. All schools, public and private, have 
the simple cost function: 





(3) C(k) = V(k) +F 


V’'>0O vV">0 


ical evidence suggesting the income elasticity to be posi- 
tive, we conserve space in the development that follows 
by assuming that SCI is strict for all households. 

9 Henderson et al. (1978) find no interaction between 
own ability and the benefits to an improved peer group, 
corresponding to 07U/860b = 0 in our model. Summers 
and Wolfe (1977) find some support for higher peer-group 
benefits to lower-ability students, that is, 07U/000b < 0. 
Thus the literature provides limited evidence from which 
to draw conclusions. 


wA 
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where k denotes the number of attending stu- 
dents. Technical differences among schools are 
not an element of our model (for simplicity). 
Hence, vouchers cannot drive technically inef- 
ficient schools from the market, an effect pre- 
dicted by some proponents of vouchers (see 
footnote 4). Let k* denote the ‘‘efficient scale,” 


(4) ..k* = ARGMIN[C(k)/k]. 


The presumption of some economies of scale 
in education is realistic (Lawrence Kenny, 
1982) and important. Otherwise, the private 
market would produce an infinite number of 
schools containing infinitely refined peer 
groups. Our model’s equilibrium will be con- 
sistent with the fact that the number of types of 
students greatly exceeds the number of schools. 

Public-sector schools offer free admission 
to all students. This open-enrollment policy 
leads to homogeneous public schools in equi- 
librium because we assume no frictions in 
public-school choice are present. Without 
equalization of 6’s in public-sector schools, 
students would migrate to higher- schools to 
reap the benefits of a better peer group. With 
equalized @’s, no incentives for switching 
schools within the public sector remain. We 
study the alternative of neighborhood school 
systems that impose residence requirements in 
Epple and Romano (1995). 

Since all public schools will have the same 
@, one can think of the public sector as con- 
sisting of one (possibly large) school. Public- 


sector schooling is financed by a proportional - 


income tax, t, paid by all households, whether 
or not the household’s child attends school in 
the public sector. Thus, y, = (1 — t)y. The 
public sector chooses the (integer) number of 
schools and their sizes to minimize the total 
cost of providing schooling subject to (3). The 
tax rate adjusts to balance the budget. Because 
households are atomistic, there is no tax con- 
sequence to a household’s decision about 
school attendance. The public finance of 
schooling can then be largely suppressed in the 
analysis until the consideration of vouchers. 


The public sector is passive in this model for _ 


simplicity. Public-sector schools do not seg- 
ment students by ability (track), increase ed- 
ucational inputs to compete more effectively 
with the private sector, or behave strategically 


~ Cobb-Douglas utility, equation (2), students will have ‘ 
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in any way. More realistic alternatives are} 
portant topics for research, some of which 
discussed in the final section. pand for pu 
Private-sector schools maximize profits, profit-maxir 
there is free entry and exit.'° Modeling priv" ,.. 
schools as choosing an admission policy (5) - M 
tuition policy is convenient and involves no l§, ==. “87? 
of generality. Student types are obsérvabli 
implying that tuition and admission can i 
conditioned on ability ‘and ificomé ‘as ‘compet ; 
tion permits.! Private schools are an examp 
of clubs with “‘non-anonymous crowding} 
(Scotchmer and Myrna H. Wooders, 198} 
Scotchmer, 1997) because of the peer-group subject to 
fect, and we model private-school behavior fot 
lowing the literature on competitive cli (5a) œ, 
economies. In particular, private schools maf 
imize profits as utility takers (see Scotchme (5b) U(-: 
1994), a generalization of price-taking whe `` 
consumers (types) and products differ. Priva: 


aith any Qo! 
public schoc 



















SF 







schools believe they can attract any stude% = MA 
type by offering admission at a tuition yieldir, jeto. 
at least the maximum utility the student cow: 
obtain elsewhere. 1 W(b,y 
Let an i subscript, i = 1, 2, ..., n, indica 
a value for the ith private school. A zero m 
script does the same for ‘‘the’’ public ot 
Let p;(b, y) denote the tuition necessary $ (5ce) ki 
enter school i, with po(b, y) = 0 Y (b, y). le 
a;(b, y) € [0, 1] denote the proportion of ty% 
(b, y) in the population that school i admit (5d) 8 
s Constrain 
© Consideration of alternative objective functions the size c 
profit maximization is reasonable, especially given thes ! Mean ab 


nificant proportion of nonprofit schools. Some pe school fr 
schools might, for example, pursue the objective of qual, 
maximization. Quality maximization, like profit maxitg: type or n 
zation, is a member of a set of objective functions thats. ulation. ! 
utility independent in the sense that they place no weipy taking as 
on offering any student types higher utility than the sg limited t 
dent’s (equilibrium) reservation utility. Our preliminzg}: Stud 
analysis of this issue suggests that equilibria where sos% ents 
private schools pursue objectives from this set other the the publi 
profit maximization must also be competitive equili A 
Roughly, the failure of any school to maximize pro; 
would permit entry by a profit-maximizing school. fr 
"' The notion is that abilities can be determined throvyy: 
testing, and required financial disclosures permit del ; „|. Oner 
mination of household income. At least in the e tve schoc 











À fenton i, 








incentive to underperform on exams, since tuition wil 
nonincreasing in ability in equilibrium (proved in Eph. Ping the py 
and Romano [1993]). Incentive compatibility in the 


A dens 
porting of income is more complex. A og 








Beh any ao(b, y) € [0, 1] ‘‘optimal’’ for the 
ome of which$ È plic school as determined by the residual de- 
L. Hand for public education. A private school’s 
cimize profits, $- pt-maximization problem can be written as. .- 
Modeling priv; 
ission policy ¥5 MAX 7; 


4d involves no | J 6 knpi( By) ai(b,y) 


admission can, 2 ff [pi(b, ya; (b, y) 
iq s 


“Wooders, 19% x f(b, y) db dy] — V (k) — F 


} the peer-groupgybject to ` 

chool bebaviori#. 

competitive cij§a) a;(b, y) € [0, 1] Y (b, y); 
ivate schools m#.- 

rs (see Scotchngsb) Uy, — pi(b, y), a4, b)) 
price-taking wi, 

lucts differ. Privg: 

ttract any stud = MAX U(y;— pb, y), a(8;, b)) 
ata tuition yiel i 5 jJZ{2,1,. a] j+ i a j(b,y) > 0 is in the optimal set of j} 
y the student com: 

» V(b. y); 

, 2, .., n, indi S 

chool. A zero sit; 


he” public sch 
A Co k = ff ac, I), y) db ay: 
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tition necessary}; 


)=0Y(b,y) E 
b proportion of if 
. d i. 7 
at school i a Sd) 9, -iff ba;(b, y)f(b, y) db dy. 


onstraints (5c) and (5d) define, respectively, 
objective functionbe size of the school’s student body and the 
ee lean ability. Constraint (5a) precludes a 
P“ichool from admitting a negative number of a 


ie the objective of quay hee 
tion, like profit maxi¥P€ Or more of a type than exits in the pop- 


jective functions that Hlation.'? Constraint (5b) imposes the utility- 
one no Aig ng assumption. Students’ alternatives are 
aes nany man ne fmited to schools where they are admitted. 


utility. Our prelimif f : 
„equilibria es students always have the option of attending 


; from this set other He public school. It is innocuous: to-require 
2 competitive equilil® se 

ool to maximize pry i 

aximizing school. ¥ 
in be determined thro 
iisclosures permit do * One might object to the presumption that ‘‘compet- 
At least in the caseiVve schools” recognize the limit to demand. The pre- 
2), students will haveg@ption is analogous to a monopolistically competitive 
ums, since tuition willf@’s recognition of a limit on its demand curve. Drop- 
ibrium (proved in Eggs the presumption would lead to schools admitting in- 













EPPLE AND ROMANO: PRIVATE-PUBLIC SCHOOLS COMPETITION 39 


that (5b) hold for all (b, y) as we have spec- 
ified (i.e., including for nonadmitted stu- 
dents). Tuition charged to students for whom 
a;(b, y) = 0 is school i’s only optimal choice _ 
(i.e., nonadmitted students) is irrelevant. Note, ` 
too, that tuition such that (Sb) holds with strict 
equality will be optimal. 

Private schools enter so long as they expect 
to make positive profits as utility takers. Be- 
cause incumbent private schools maximize 
profits as utility takers, entry results if and only 
if m, > 0 for some incumbent school. The 
public-sector/private-sector equilibrium is de- 
scribed by the following five conditions in 
addition to the government balanced-budget 
condition presented below in Section I, sub- 
section C, for the more general case with 
vouchers. 

Condition UM: 


U*(b, y) 


= MAX U(y, — p;(b, y), a(6;, b)) 


iE {0,1,... n |æ;(b,y) > 0 is in the optimal set of i} 
v (b, y). 
Condition TIM: 
T0: , ki, pi(b, y), a:(b, y) ] satisfy (5), 


Condition ZIT: 
mT; =O i=1,2,..,n. 
Conditions PSP: 


Polb, y) = OV (b, y) 


s ty i kam 


ao(b, y) € [0, 1] V(b, y) 


ko = | f aoto, VF, y) db dy 


Ss 
1 
o ar bay(b, y) f(b, y) db dy. 
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Condition MC: 
È a;(b, y) = 1 V(b, y). 
i=0 


Condition UM summarizes household util- 
ity maximization. Households choose a most- 
preferred private or public school, taking 
admission/tuition policies, school qualities, 
and taxes as given. Profit maximization of pri- 
vate schools (LIM) and the public-sector pol- 
icies (PSP) have been discussed. While the 
entry assumption above is formally part of the 
definition of equilibrium, it is convenient to 
substitute the implication that private schools 
must earn zero profits (ZII). The last condi- 
tion is market clearance, which uses the sim- 
plifying assumption above that free public 
schooling is preferred to no schooling. 


II. Theoretical Results 
A. Solution to the Private School’s Problem 


Using UM, the first-order conditions for 
problem (5) can be written as follows: 


(6a) U(y, — pi, a(9;, b)) 
= U*(b,y) V(b, y); 

=0 

(6b) a;(b, y)\ € [0, 1] 
et 
< 

asp? (b, y, 8; ) = V'(k;) 

> 


+7(8-b) Vib, y); 


1 Opi (b, y, 6:) 
(6c) q= i JJ [Piat a (0, y) 


x f(b, y) db ay |. 


Condition (6a) describes school i’s optimal tu- 
ition function, př (b, y, 6;) and is just (5b) 


` change in’school i deriving from a change: 
















with equality combined with the equilibr; 
condition UM; př (+) is student-type (b, y 
reservation price for attending school of q 
ity 6,;. Condition (6b) characterizes opti 
admission policies.'* The term 7; (6; — b) me 
be thought of as the marginal cost of admissif’¢-hools m 
operating via the peer-group externality #¢chools in ` 
school i. From (6c), n; [the Lagrangian mě gents wou 
tiplier on (5d)] equals the per-student reve ‘private sct 
» Why m 
§;. The appropriately scaled change in 6; dy schools ct 
to admitting student of ability b equals (b¥ yate schot 
6; ); its negative is then multiplied by n; tog! they wou. 
tain the peer-externality cost. The peer costy: Conseque: 
admitting students with ability below t? fective mi 
school’s mean is positive because the resulti and their 
quality decline dictates reduced tuition to # would eq 
students, while the peer ‘‘cost’’ of admitth¥ portunity 
above-mean-ability students is negative. Ig varying a: 
MC;(b) = V’ (ki) + 7;(8; — b), which ve either: (a; 
term effective marginal cost. Types with 1e& body that 
ervation prices below MC; (b) are not willivs crease qu 
to pay enough to cover their effective margin® 


cost and are not admitted. The school admi¥: school di 
all of a type that has a reservation price abo} same tim 
effective marginal cost, and any a; € [0, 1]§ mits prot 
optimal if př = MC;.'* quality ci 
p We sk 

B. Properties of Equilibrium E ity impre 

p ing ident 


We now turn to the properties of equillf that this : 
rium, assuming one exists. Existence issu, school ac 
are discussed below. Heuristic arguments hai ás it exp 










been substituted for formal proofs whig, and y, > 
reasonable. 4 change i 
The first result concerns the qualities %, ther, che 
schools. a sible) sv 
fF, that, usj 

PROPOSITION 1: A strict hierarchy &%, ‘teased 
school qualities results, with the public sect: types, e 


from (6a) into (5), and then forming a Lagrangian 
tion to take account of (5c) and (5d). Result (6b) is 
derived by pointwise optimization over a; while t: 
account of the constraint (5a). 


first-order conditions, V” sufficiently large implies | 
maximization. 
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A ing the lowest-ability peer group: 0, > 
fy.. >- > 8, > b. 

i formal proof is in the Appendix. Here an eco- 
) omic interpretation is provided. All private 
isi#éshools must be of higher peer quality than 
p externalityÉchools in the public sector. Otherwise, no stu- 
Lagrangian m Hents would be willing to pay to attend any 
private school. 


hey would compete perfectly for students. 
i peer cosfConsequently, they would have the same ef- 





vation price abo Rame time attracting a student ey that per- 
any a; € [0, limits profitable price discrimination over the 
uality change. 

} We sketch the example of a profitable qual- 
ty improvement, beginning with schools hav- 
g identical student bodies (the proof shows 
itbat this is without loss of generality). Let one 
schoo] admit the same number of (bz, y2) types 
#S it expels of (b, yı) types, where b, > b, 
fnd y, > Yı, implying an increase in @ but no 
hange in production costs, V(k) + F. Fur- 
her, choose the types (which is always fea- 
fible) such that y, — y; > bı — b, by enough 
hat, using SCI, the (bz, y2)-types value in- 
eased quality by more than the (b,, yı)- 
ypes, even though their abilities differ. This 
ermits the school to charge the newly admit- 
ed students tuitions higher than their effec- 
ve marginal costs because they are selected 
ind by substituting. value quality increases by moré than the 
4). Sener an pelled students. The profit increase occurs 
„over a; while tarC4use the new student body values the qual- 
Increase by more than would the original 
f (6b), the solutiostudent body; ð and 7 rise in the school. It 
a Scar ore not increase profits to substitute stu- 
) € [0, 1] satisfies Fits in such a way that 8 rises without also 
tly large implies of 28ing the student body’s average value of 
‘ality improvements, because tuitions equal 


ict hierarchy 
1 the public sec 


re 
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effective marginal costs in the initially non- 
differentiated schools.'° This example as- 
sumes that a school substitutes students to 
increase quality, but alternative profitable sub- 
stitutions exist-that.decrease quality, roughly, 
by also creating a lower-income student body. 

In either case, the argument depends on SCI. 
It also identifies the model’s force for ‘‘diag- 
onal stratification’’ (see the examples in Fig- 
ure 1). As developed more fully below, this 
stratification results because students having 
relatively high income and low ability within 
a school cross subsidize relatively low- 
income, high-ability students. 

The strict hierarchy of Proposition 1 sup- 
ports the equity-related concerns of some that 
private schools operate to the detriment of 
public schools by siphoning off higher-ability 
students. Whether a strict hierarchy is efficient 
is analyzed below. First we develop further the 
positive properties of equilibrium. 

Proposition 2 describes equilibrium pricing, 
and Proposition 3 describes the resulting par- 
tition of types. Some definitions are useful. Let 

= {(b, y) E S|a;(b, y) > 0 is optimal } 
denote the admission space of school i, i = 0, 
1, ... , n (see Figure 1, for example). A locus 
of points (b, y) E A; N A, i + j, assuming 


` it exists, is referred to as a boundary locus be- 


tween i and j. (Boundary loci have zero mea- 
sure in S, as proved in Epple and Romano 
[1993].) Since any household prefers free 
public schooling to no schooling, the entire 
type space S is partitioned into admission 
spaces. Last, to avoid tedious qualification of 
statements for public-sector schools, we spec- 
ify that MC, = 0 for all (b, y). This notation 
is convenient since students see a zero cost of 
public education. 


PROPOSITION 2: (i) On a boundary locus 
between school i and į, p; = MC;(b) and p; = 
MC,(b); pricing on boundary loci is strictly 
according to ability in private schools. (ii) 
pi(b, y) > MC; (b) for off-boundary students 
who attend private school i; pricing off- 


'S Mathematically, beginning with equal 6’s, first-order 
effects on profits of varying admissions vanish, but the 
profit function is convex in some directions in [a(b, 
y), p(b, y)]-space, allowing a profit increase. 
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boundary loci depends on income in private 
schools. (iii) Every student attends a school 
that would maximize utility if all schools in- 
stead set p; equal to equilibrium MC; for all 
students. The allocation is as though effective 
marginal cost pricing prevails in private 
schools.'® ` 


Sée Epple and Romano (1993) for proof. 
Competition between private schools that 
share a boundary locus forces prices to effec- 
tive marginal costs for student-types on the lo- 
cus. These students are indifferent to attending 
the schools sharing the locus. Private schools 
then have no power to price discriminate with 
respect to income on boundary loci. Prices are, 
however, adjusted to differing abilities be- 
cause private schools internalize the peer- 
group effect. Tuition to private school i 
decreases with ability at rate 7; along its 
boundary loci, reflecting the value of peer- 
group improvements of the school’s student 
body. 

Moving inside a boundary locus in a private 
school’s admission space, students’ prefer- 
ences change in such a way that they would 
Strictly prefer the school attended if it prac- 
ticed effective marginal-cost pricing. Part (ii) 
of Proposition 2 establishes that private 
schools exploit this by increasing price. These 
students are also indifferent between the pri- 
vate school attended or their best alternative 
by (6a), but this is a result of discriminatory 
pricing. Generally, then, price depends both on 
ability and income within admission spaces.’’ 

Part (iii) of Proposition 2 follows because 
it is profitable for a private school to be sure 
to attract any student whose reservation price 


16 The statements regard the equilibrium effective mar- 
ginal cost. Income effects would cause these costs to 
change if tuition equaled effective marginal cost for all 
students. This has distributional (but not efficiency) 
implications. 

17 While there are no published studies of the allocation 
of financial aid by income and ability among private ele- 
mentary and secondary schools, there is evidence on the 
allocation of financial aid by colleges and universities. 
There the evidence is that both ability and family income 
are significant determinants of whether and how much fi- 
nancial aid is received (J. Brad Schwartz, 1986; Sandra 
R. Baum and Saul Schwartz, 1988; Charles T. Clotfelter, 
1991). 


















exceeds the school’s effective marginal cogg: fh: g” 
The student allocation’s link to effective mą} equilibrium.” 
ginal costs, and hence abilities, will be show. 
to be efficient (except for the public secto} `. 
The income-related price discrimination thf: poids with stude 
occurs does not disrupt the allocation consi% fering incomes 
tent with effective marginal-cost pricing} jpdifference cur 
rather, it is purely redistributive. $ sloping. For the 
While this income-related price discrim} any indifferenc: 
nation is of the first degree (a la Pigou), itt encé curve of > 
magnitude is limited by competition for st gs if tuitions € 
dents among the differentiated schools. Neanf [part (iii) of P: 
boundary in a school’s admission space, a styf between schoo 
dent’s preference for the school attendat the (8, p)-ple 
would be slight under effective marginal-cog! MC; (b)) and ( 
pricing, so that the admitting school can cap} be that MC;( 
ture little rent. The number and sizes of privat chooses i. A 
schools then determine their power to pricf ment then app 
discriminate over income. All private schook} ` Part (ii) is 
have student bodies less than k* by a simil} we provide so 
argument to that in more standard monopolis the demand * 
tically competitive equilibria.'* Here schod} ability (¢.g., 
i’s marginal-revenue curve can be constructei fication) and 
by ordering from highest to lowest student’; same discour 
reservation prices minus peer costs [i ary loci (i.e. 
pi + 7;(b — 6;)], and thus the associate} Holding non 
downward-sloping average revenue curve maj real income 
be derived. Zero profits then implies a scal% due to tuition 
below k*. If we let k* decline, then privat SBA would t 
schools become more numerous and less diti plaining SB] 
ferentiated (have closer 6’s), and income} positive inc 
related price discrimination declines. f counts to abi 
Now consider the partition of types intf and SBA, t) 
schools. We say stratification by income (SBIẸ Figure 1. Rs 
holds if, for any two households having sty ability stud 
dents of the same ability, one households! low-income 
choice of a higher-§ school implies it has if Private schc 
weakly higher income than the other housek Strongly if ` 
hold. Analogously, stratification by ability, SCB holds 
(SBA) is present if, holding income fixed, tk dition is ne 
household that chooses a higher-6 school musk, Ur other re 
have a student of weakly higher ability. Thy: SBA. It ma 
combination of SBI and SBA implies a diag} ditions to ; 
onalized partition as, for example, in Figure! boundary 1c 


PROPOSITION 3: (i) SBI characterizé 
equilibrium. (ii) If preferences satisfy wealk 
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single crossing in ability (W-SCB) and n, 7% tinier ad 
£ Oinvestigate 
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18 The points made here are proved in Epple É are willi 
Romano (1993). alternative 
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- a aus fe ts r 
Š To confirm part (i), consider two house- 
$: holds with students of the same ability but dif- 
ition conii fering incomes y? > y;. In the (8, p)-plane, 


ost pricing F indifference curves of a household are upward 
W sloping. For the same ability, SCI implies that 

[ce discringg ny indifference curve of y? cuts any indiffer- 
Pigou), Pence curve of y} from below. Allocations are 
tion for sas if tuitions equal effective marginal costs 
ools. Neak [part (iii) of Proposition 2]. Thus, the choice 
Space, a t between schools i and j may be represented in 


HMC, (b)) and (6;, MC;(b)). If 8; > 0; , it must 
abe that MCj(b) > MC; (b) if either type 
“chooses i. A standard single-crossing argu- 
iment then applies to complete the proof. 
rivate scho; Part (ii) is proved in the Appendix; here 
i we provide some intuition. Assume first that 
rd monopoli the demand for quality is independent of 
* Here schofability (e.g., as in the Cobb-Douglas speci- 
# fication) and that all private schools give the 
west studensame discount to ability along their bound- 
itgary loci (i.e., schools’ 7’s are the same). 
@Holding nominal household income fixed, 
real income would rise with student ability 
kpss a scidue to tuition discounts at all private schools. 
te, then privÆSBA would then result by the same logic ex- 
gpositive income elasticity (SCI) and dis- 
(gcounts to ability alone would cause both SBI 
imgand SBA, the diagonalized partition as in 
#Figure 1. Relatively high-income and low- ` 
blds having sigability students cross subsidize relatively 
me householi#low-income and high-ability students in 
implies it hasgP"vate schools. The argument holds more 


the other ho astrongly if the 7’s strictly ascend or if W- 
ation by abilite 


iKSCB holds strictly. However, neither con- 
ncome fixed, edition is necessary for SBA, nor do any of 
1er-9 school mi oa other results require these conditions or 
gher ability. OBA, It may be possible absent these con-.. 


A implies a diajfitions to get cases having nonmonotonic 
boundary loci in the (b, y)-plane.”° 










-SCB) and n$ O We thank an anonymous referee for encouraging us 
i: oga agate bid-rent functions (see e.g., Masahisa Fujita, 
E l 5289), which ultimately led to part (ii) of Proposition 3. 


; a j The alternative to W-SCB implies that lower-ability 
are willing to pay more for a better peer group, and 


proved in Epple g F ; 
dks alternative to weakly ascending 7's implies that lower- 
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We now turn to normative results which are 
quite intuitive. Again, see Epple and Romano 
(1993) for the formal analysis. Pareto effi- 
ciency requires: (i) a student allocation that 
internalizes the peer-group externality given 
the number of schools, and (ii) entry as long 
as aggregate household net willingness to pay 
for an allocation with one more school exceeds 
the change in all schools’ costs. An equilib- 
rium without a public sector would satisfy 
condition (i) but not condition (ii). Effective 
marginal cost includes the marginal value of 
the peer group externality, implying that 
MC; (b) equals the social marginal cost of at- 
tendance at school i by a student of ability b. 
A purely private-school equilibrium then sat- 
isfies efficiency condition (i) by part (iii) of 
Proposition 2. 

However, entry to the point of zero profits 
entails externalities so that efficient entry 
[condition (ii)] fails to hold in a fully private 
equilibrium. An entrant captures the full value 
of its product to the student body it admits but 
ignores utility changes of nonadmitted stu- 
dents and profit changes of other schools re- 
sulting from the reallocation.” Fixed costs, 


quality schools give bigger discounts to ability. Either 
would tend to work against pure ability stratification, 
though Proposition 1 implies that some degree of ability 
stratification would be present. It is desirable to demon- 
strate SBA without assuming ascending 7’s, since these 
values are endogenous. However, providing general prim- 
itive conditions for SBA independent of assumptions con- 
cerning the equilibrium 7’s is difficult, because their 
equilibrium values depend on the entire distribution of 
types in the population. For the Cobb-Douglas case and 
assuming independence of income and ability in the pop- 
ulation, we (Epple and Romano, 1993) have shown SBA 
without assuming weakly ascending 7)’s. 
2! The comparison of the equilibrium number of 
schools in a fully private equilibrium to the Pareto- 
‘efficient number entails „a, trade-off. The entrant ignores 
the lost revenues and cost savings to other schools from 
the students that it admits. Since almost every student at- 
tracted away from incumbent schools is inframarginal 
(i.e., tuition exceeds effective marginal cost), the net ef- 
fect here of entry is negative, tending to cause too much 
entry. Opposing this is the entrant’s failure to capture the 
full returns from increased variety of school qualities that 
results, Although the entrant fully price discriminates to 
the students it admits, it cannot tax other students for the 
adjustments in the incumbent schools’ qualities. A net 
benefit to other students is likely to result because the in- 
cumbent schools will better accommodate preferences. 
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hence the finite size of an entrant, underlie the 
entry externalities as in many models of mo- 
nopolistic competition. 

Introduction of the free public sector implies 
deviations from both efficiency conditions. In 
general, the public sector displaces multiple 
differentiated private schools, substituting the 
equivalent of one -‘‘large’’ homogeneous 
school. This effective reduction in the number 
of schools is without attention to costs and 
benefits, generally implying a deviation from 
efficiency condition (ii). 

Holding fixed the number of schools in the 
public—private equilibrium (and counting the 
public sector as one school), zero pricing of 
public schooling violates condition (i). By 
just reallocating students between the public 
sector and private school 1 near their shared 
boundary locus, Paretian gains are feasible. 
Reference to the upper panel of Figure 1 from 
our computational equilibrium may help clar- 
ify the argument. Gains would result from 
shifting into private school 1 relatively lower- 
ability students below but near the boundary 
locus, students for whom the marginal social 
cost in the public sector is positive. These stu- 
dents are nearly indifferent between the two 


schools when facing the social cost of attend- 


ing the private school but a tuition (zero) be- 
low the social cost of attending the public 
school. Students near the boundary locus and 
attending the private school may also be of 
sufficiently high ability that the social ‘‘cost’’ 
of attending the public school is negative. 
Gains from shifting such students into the pub- 
lic sector are then also feasible. Such students 
exist in our computational model, the rough 
prescription being to rotate the boundary locus 
counterclockwise at the point of ability having 
zero social marginal cost in the public school. 
Collecting these results, we have the following 
proposition. 


PROPOSITION 4: (i) The allocation in a 
fully private equilibrium is (Pareto) efficient 
given the number of schools; the equilibrium 
-- number of schools is not, however, generally 


This positive externality will tend to cause too little entry. 
We believe that too many or too few private schools are 
possible, but we have not proved this. 


--equilibrium will be correspondingly sm 





efficient. (ii) The public—private-sector egy: 
librium has neither an efficient number fi; zn; 
schools, nor an efficient student allocatigg: ublic sect 
given the number of schools. Kation. Vi 
; vate educa! 
When fixed costs of schooling are small, t we exami 
departure from efficiency in a fully privy ; computatic 









Part (i) of Proposition 4 can then be inf D. Æ 
preted as making a case for private schooling Z 
and the vouchers we study. However, we hay? 

some reservations concerning this efficieng / Aswed 


result. First, we are sympathetic to the viewgt jjbrium ge 
many that access to a quality education is} te teger num! 
right and serves as a means to limit historia? an approx: 
inequities. Second, longer-run externalitigg tional an: 
from education not considered by privat} equilibriur 
schools, like reduced crime, may be presen} private sct 
For these reasons, we explore the Conse} crease pro: 
quences of vouchers on -all types instead dt denote th: 
just providing aggregate measures. A some. earned by 
what distinct concern arises because exa mize profi 
equilibrium exists only in special cases. The and replac 
interpretation of the efficiency results in tht with 

approximate equilibrium we study is discussa 
in subsection D, below. 
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4B tion. Vouchers lower the real price of pri- 
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g this efficient’, 
‘tic to the view 
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ered by privteguilibrium’” requires that no (utility-taking) 
may be prese#rjvate school, incumbent or entrant, could in- 


‘eared by incumbent schools [which maxi- 
nize profits over p(b, y) and a(b, y) locally], 
ind replace ZII in the definition of equilibrium 







MAX[ Tmax; Nmax ~ Amin» — Tmin] = €. 


Here Tmax equals the maximum potential prof- 
ts to an entrant, and the maximum of the sec- 
cash awards tond two terms in the brackets equals the 
ool.” No role largest feasible profit increase by an incum- 
tuition-free pubbent school. The revised definition of equilib- 
del by everywhiium continues to require UM, PSP, MC, and 
oucher, v, to y,flocal profit maximization by incumbent pri- 
rivate school. Wate schools [i.e., (6a) —(6c)]. Last, the num- 
aint is: t of private schools is the minimum number 
atisfying these requirements. 










ve properties of an exact equilibrium except 
hat private schools could gain € in profits via 
obal adjustments. The allocation of students 
a fully private equilibrium would then con- 
ue to. satisfy efficiency condition (i). 
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Tilustrative Results 


| We develop a computational model to illus- 
ile tonsa aoe te our results, to examine vouchers, and to 
nover the nition paid Plore issues for which comparative-static 
ereby avoiding consid: aly Sis may yield ambiguous results. We cal- 
rate it to existing empirical evidence so that 
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: The epsilon equilibrium retains all the pos- 
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the results will provide at least suggestive ev- 
idence about the impact of policy interven- 
tions. However, scant empirical evidence 
exists on some important parameters of the 
model. 


A. Specification and Calibration 


We require specifications for the density of 
income and ability, the utility and achievement 
functions, and the cost function for education. 


We assume that E is distributed bivar- 
ln(y) 


Ho 


Hy 


iate normal with mean | | and covariance 


matrix 


c? PITy 

Poy o? l 
To calibrate the distribution of income, we use 
mean ($36,250) and median ($28,906) in- 
come for households from U.S. census data for 
1989. With units of income in thousands of 
dollars, these imply that u, = 3.36 and 
oy = 0.68. 

We adopt specification (2) for the combined 
utility-achievement function. To calibrate the 
ability distribution we presume that educa- 
tional achievement determines future earnings 
and that the benchmark economy is in a steady 
state. First, define normed achievement, ay, 
as our achievement function raised to the 
power 1/ and multiplied by a constant, ay = 
Ka"? = K07/*b.™ Then, a student with ability 
b attending a school with a peer quality of 6 
is presumed to have future annual earnings (E) 
given by In E = Inay=InK + (y/f)In@+ 
In b. This normalization is such that a per- 
centage change in ability leads to the same per- 
centage change in dollars earned. Henderson 
et al. (1978) report the change in achievement 


“percentile that results from moving: students ... 


from classes stratified by ability to mixed 


2 The constant of proportionality, K, is arbitrary. A 
convenient scaling is to set K = E{b]~*’*. This scaling 
has the property that, if all students in the population were 
to attend the same school (i.e., 9 = E[b]), then normed 
achievement would equal ability (i.e., ay = b). 
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classes. An elasticity of achievement with re- 
spect to peer ability that is 30 percent as large 
as the elasticity with respect to own ability is 
representative of the results they report. We 
adopt the somewhat conservative value of y/8 
= 0.2. To complete the calibration of the dis- 
tribution of ability, we then assume that the 
observed household-income distribution is the 
income distributjon that emerges in a steady- 
state equilibrium in our benchmark model.” 
This yields u, = 2.42 and o, = 0.61. Thus, 
mean and median ability are 13.6 and 11.3, 
respectively, and the standard deviation of 
ability is 9.1.7 

Gary Solon (1992) and David J. Zimmerman 
(1992) provide evidence on the correlation be- 
tween father’s income and son’s income, and 
they both find that the best point estimate of 
this correlation is approximately 0.4. Intergen- 
erational correlation in income arises from two 
sources: correlation between household income 
and student ability and, for given ability, cor- 
relation between income and quality of school 
attended. Hence, SBI suggests that the inter- 
generational correlation in incomes is an upper 
bound on the correlation between parent’s in- 
come and child’s ability. For purposes of sen- 
sitivity analysis, we then assume that p € [0, 
0.4]. For our benchmark case, we set p = 0, 


*4 More precisely, we let the distribution of ability be 
lognormal, and we approximate by assuming that this gen- 
erates a lognormal distribution of earnings. We set the first 
two moments of the distribution of earnings equal to the 
first two moments of the distribution of income. That is, 
we choose ju, and g, such that our benchmark equilibrium 
has Ela@y] = Efy]/m and Var[an] = Var[y]/:?. The 
constant m is the ratio of employed workers per household 
to the number of students per household (m = 2.6 in 
1990). The distribution of earnings will not be exactly 
. lognormal because of the discrete difference in schools 
attended, even though the distribution of ability is pre- 
sumed to be lognormal. If every student attended public 
school in the benchmark model, and hence faced the same 
, earnings would be exactly lognormal. The approxima- 
tion is a good one because 90 percent of the students do 
attend public schools as we will see. 

%3 Ability can be related to IQ. Using IQ ~ H(100, 
256), one obtains In b = —1.38 + 0.038(IQ). In our no- 
voucher steady state, this implies that a worker with an IQ 
of 100 has expected income of $22,074, and a 10-point 
increase in his IQ increases expected income to $32,510. 
See the discussion in what follows relating to Figure 6 and 
the calculation of expected steady-state income condi- 
tional on ability. 
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which is particularly convenient for our steag} -j 
state calibration of the model. This compleigt: the benchm 
the calibration of f(b, y). | are sensitiv 
We now complete the calibration of prep}. cost of sche 
erences. The Cobb-Douglas specification inf” we set € 
plies unitary price and income elasticities ft sufficient tc 
school quality, 6. Given the absence of empi librium for 
ical evidence on the demand for quality, tregt io $4,200 p 
are plausible focal values and are consister! 
with estimates of demand for school expen 
diture (see e.g., Theodore Bergstrom et al} 
1982). This function also implies that the maf For our 
ginal rate of substitution between school qua} yoycher, tl 
ity and the numeraire is invariant to Ow} the studen 
ability. Empirical evidence is mixed abou}. schools co 
whether an improvement in peer group is morf actual U.S 
beneficial to high- or low-ability students} public sch: 
Hence, our model’s assumption that the effeg} percent. Ir 
of peer group is not biased toward either high? public-sec’ 
or low-ability types seems an appropriatl fects on ot 
choice for a baseline model. If school qualit? also small 
could be purchased at a constant price per uni p = 0, 
of quality, each household’s expenditure of; 
education relative to total expenditure on othe} 
goods would be y/(1 + y). The existing shar} 


































of aggregate disposable personal income in th centage of h 
United States that is spent on education is apf peice 
proximately 0.056. Hence, we set y = OUP Tere aren 

Using y/8 = 0.2 from above, the calibrate United State 

utility—achievement function is then k substituting 

f average cost 

Z = 0.06 7, 0.30 V focus on pe 

“= 6 PREES } telated per-} 

Pa , UR the per-stud:i 

We chose a cost function that is quadrati: "This v: 

in the percentage of students (or households} operating at 
a school serves: g alive to fix: 

Course, a mi 

2 È have studiet 

F + V(k) = 12 + 1,300k + 13,333k*, fF cient schoo 

K, Sant. We fi 

with parameters set as follows. Expenditurg, kem 

per student in public schools in 1988 wsk: wold expa 

$4,222 (Statistical Abstract, 1991 p. 434) aly is made suf 
there was '/, student per household ( Statistico, Teasing fix - 

Abstract, 1992 pp. 46, 139). We specified off, FFF cost co 

benchmark case to have four private schoot! gin sost è 

a to si 

and chose parameter values such that average that substa 
cost in equilibrium was approxiniately $4,205 Re not sens” 

per pupil.*° Experimentation indicated thé. tude of fi. 


ko; Sts is the 
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: ailibrium properties are not very sensitive to 
> benchmark number of schools, but rather 
= sensitive to the minimum of the average 
TERRE of schooling. E 

ecification į f We set £ = 4.2. This is the minimum value 
elasticities (ir ee cient to assure existence of epsiloñ equi- ` 
ence of CMsrium for voucher values varying from zero 
r quality, th: ¢4 200 per student.” 

are consistg 


school expe B. Results 

rgstrom et 

es that then or our benchmark equilibrium with no 
len school qui‘ ucher, the public sector has 90 percent of 


the student population, and the four private 
ib mixed ab¥chools combined serve the remainder. The 
tual U.S. percentage of students enrolled in 
iblic schools during this period equaled 88 
m that the eff®sercent. Increasing p from 0 to 0.4 reduces 
zard either higfyblic-sector attendance to 88 percent. Ef- 
an appropri&fects on other variables of so changing p are 
[f school qualffjiso small, and the results that follow are for 
ant price per w$ = 0. 
oe 
expenditure @;. 
enditure on o 
he existing sh : 
nal income in {eentage of households served, k. In terms of k, average 
| education is cost reaches a minimum at $2,100, with k* = 0.03; $2,100 
__ nén then be interpreted as the average cost per household. 
e set y = 0U 
Teas Fihere are twice as many households as students in the 
ve, the calibrat Únited States. Letting s denote the number of students and 


is then fubstituting s = k/2, one sees that the minimum of the 
average cost per student is $4,200. In our presentation, we 
640.30 {focus on per-student measures of tuition and costs; the 


elated per-household measures are simply half those of 
per-student values. 
This value is about 7 percent of the cost of a school 
perating at a scale that minimizes cost per student. Rel- 
lative to fixed cost, e is approximately 35 percent. Of 
kourse, a minimal £, however measured, is desirable. We 
‘Have studied how the minimum e varies as we vary effi- 
Rient school scale, k*, while holding average cost con- 
fant. We find that the requisite s to support equilibrium 
lows. Expendi paries approximately proportionately with k* if fixed cost 
ools in 1988 8 varied proportionately with k*. This suggests, as we 
Would expect, that s can be made as small as desired if k* 
f, 1991 P. 434) "i£ made sufficiently small. We have also investigated in- 
asehold (Statisti@teasing fixed cost while holding k* and minimum aver- 
}- We specified ¢8° cost constant. This tends to reduce the ratio of € to 
>ur private scho i but rede aoe magnitude of € oe 
Wired to sustain equilibrium. Our investigation reveals 
S such that averia substantive findings from the EEEE model 
proximately $4,24¢ not sensitive to the choice of k* or the relative mag- 
ion indicated 


1 that is quadr 
s (or householé 





Dk + 13,333k?, 





tude of fixed to variable cost. Rather, the key aspect of 
i ts is the value of average cost at the minimum, and as 
i fussed in the text, this value is based on observed 
‘f200l costs, The problem with pursuing a calibration that 
er lowers e is that it leads to a computationally un- 
ageable number of schools for large vouchers. 


function in terms of! 
i equivalently, the PS 
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Other computational results are presented in 
Figures 1—6. The upper panel of Figure 1 pres- 
ents the boundary loci and admission sets in 
type space, in addition to the equilibrium @’s 
and k’s. Here and in some other figures, both 
absolute and percentile ability scales are pro- 
vided for perspective. The lower panel dis- 
plays the allocation for a voucher of $1,800. 
The linear boundary loci derive from the 
Cobb-Douglas specification. For results we 
present, intersections of boundary loci, if any, 
occur very near the bounds of the support of 
type space. Such intersections are insignifi- 
cant, but we have encountered cases having 
only a piecewise linear public—private bound- 
ary due to meetings of loci well interior to type 
space. In such cases, some (high-income) stu- 
dents in public-sector schools have other than 
the lowest-@ private school as their best 
alternative. 

In addition to illustrating the strict hierarchy 
of schools, the data in Figure 1 show a nega- 
tive correlation between school qualities and 
school size in the private sector. Those who 
attend lower-quality private schools face 
closer substitutes, flattening the derived aver- 
age revenue curves (discussed above) of these 
schools and increasing their size in equilib- 
rium. The negative correlation of school qual- 
ity and school size is a testable prediction, and 
it seems plausible that the most elite schools 
are smallest. 

Figure 2 shows effective marginal costs for 
the four private schools in equilibrium with no 
voucher, truncated at the maximum ability of 
students attending each school. For students 
on the admission boundaries of their schools, 
tuition rates equal effective marginal costs. 
While price discrimination by income as well 
as ability is practiced on the interior of 
schools’ admission spaces, price is close to 
marginal cost for almost all students. Hence, 
as a first approximation, one can interpret the 
marginal-cost functions in Figure 2 as price 

` functions: High-ability students are.seen tore- 
ceive a tuition discount (financial aid) in all 
schools. In addition, students of sufficiently 
high ability pay negative tuition (i.e., a tuition 
waiver plus a stipend) in the top three schools. 
For our calibration, the standard deviation of 
ability is about 9 units. The results imply tui- 
tion reductions of roughly $2,350 and $3,240 
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for a one-standard-deviation increase ih ability ` 
in schools 1 and 4, respectively. 

The top panel of Figure 3 shows the ability 
distributions in the public- -school sector and in 
the four private schools in the computed equi- 
librium. These distributions, all normalized to 
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FIGURE 2. MARGINAL-CosT FUNCTIONS WITH FOUR PRIVATE SCHOOLS 









$; ability types attending public school is illus- 
N ‘trated in the bottom panel of Figure 3. To il- 
$ lustrate the effect of the voucher program on 
È public-school attendance of students of vari- 
kous ability levels, we have not normalized 
Mi these distributions to have an area of 1. Thus, 
mthey represent the ‘‘numbers’’ of students at 
Byeach ability level that are served by the public- 
$ school sector. 

ES Figure 4 illustrates several ways in which 


P equilibrium changes with the voucher. The up- 


"DK 



















per panel shows the decline.in the proportion 
pi of students attending public school as the 
svoucher increases. The curve showing size 


during ‘‘transition’’ assumes the parental- 


¥/ 


Huy : . 
cussed above. When a voucher policy is 





: introduced or varied (i.e., during transition), 
ihe income distribution of the parent genera- 
‘mn will typically differ.from the income dis- 


pihe new voucher changes the distribution of 
Wecamings of a generation relative to that pre- 
ceding. By contrast, in steady state, the income 


t of vouchers4 Bidistribution is unchanging from generation to 
a Becneration. To obtain the steady state for each 
dent, $4,200. : alue of the voucher, we hold the ability dis- 


x i + . : 
ithe distribution $ ution fixed. We find the parent-income dis- 


Wtibution of the succeeding generation becausé ` 


P 


tribution that leads to an earnings distribution 
of the next generation that replicates the in- 
come distribution of the parents. The size of 
the public sector is virtually the same in tran- 
sition and steady state (Figure 4), as are other 
variables that can be compared across the 
cases. To conserve space, other results pre- 
sented are for the transition except when stated 
otherwise. 

As expected, the size of the public-school 
sector declines to zero when the voucher nears 
the minimum average cost of providing edu- 
cation. The number of private schools at each 
voucher value is also shown in the top panel 
of Figure 4, illustrating the entry of new pri- 
vate schools as the voucher is increased. While 
the graph shows a continuous approximation, 
the number of schools is, of course, an integer 
at each voucher level in our computational 
model. We also calculated for each voucher 
level-the percentage of households that favor 


that voucher level as compared to a voucher ``” 


of zero. As the top panel of Figure 4 shows, 
support for a voucher program increases with 
the voucher but never reaches a majority. For 
example, only about 31 percent of the popu- 
lation benefits from a $2,000 voucher, and this 
includes the 10 percent who attend private 
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school with no voucher. A theme of our com- 
putational findings is that, while gains from 
vouchers result on average, there is a majority 
with relatively small loses and a aniihority with 
relatively large gains. 
The bottom panel of Figure 4 also shows the 
` per-student welfare gain, compensating varj- 
ation plus net profit change (the latter rela- 
tively small), associated with the introduction 
of the voucher. The welfare gain associated 
' “with the maximum: voucher is a relatively 
modest 0.5 percent of mean income, but as we 
will show, the distributional effects are more 
substantial. Welfare rises with the voucher un- 
til about 86 percent of the population attends 
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ative entry externalities come to dominate “i it imi: 
fare gains beyond this point (see footnote 21}¥: Vor 
as is manifest in rising average costs of privat to the pri: 
schools. The bottom panel of Figure 4 shovp; 
the tax rate at each voucher level. For 10% 
voucher levels, the cost of the voucher is mo% 
than offset by the reduction in public-sch®% 
costs resulting from students who are indu#:: 
by the voucher to choose- private school, 94 
the tax rate falls. Eventually, the tax rate ml" 
rise, however, since a sufficiently laf 
voucher covers the cost of education for Y} 
tually all students in all schools. 
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g 4 Figure 4 also shows that per-student normed 
‘Bact ievement, or equivalently, average future 
0 P i 2 nings of those who will work, rises and then 
declines with the voucher. The partition of stu- 
g lents that maximizes earnings (gross or net of 
Schooling costs) entails a strict hierarchy of 
of kec chools with stratification by ability but not by 
: 4 ome. This is implied by the complementar- 
oity of b and @ in our normed achievement func- 
d 2 pn, and its property that peer effects impact 
Wr-student earnings more the higher is a 
E school’s 0. Maximizing welfare differs from 
udent) Fmaximizing net earnings in our model because 
demand for student achievement depends on 
"Bhousehold income. By one interpretation, our 
Finodel ascribes a household consumption mo- 

A tive to the educational achievement of the 
shouschold’s child(ren) (see footnote 8). The 
urther partitioning of students caused by in- 
Screasing the voucher will tend to increase wel- 
Ee as we have seen, but it has less-clear 
effects on ability stratification and, hence, on 
eamings. One effect of an increased voucher 
i to increase competition for high-ability stu- 

Q dents, which causes the boundary loci between 
ficumbent private schools to become flatter 
Fas further discussed below). This decreases 
the “degree” of ability stratification. Coun- 
leracting this is the migration of students into 
the private sector and the finer partition of stu- 
pene, dents associated with entry of private schools. 
the latter effect dominates at low vouchers in 


F PRIVATE SCHOogA 
AND ACHIEVEMENT fOUr computations, and earnings rise initially 
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D Tax RATE With the voucher. The former effect comes to 
oe {dominate for moderately large vouchers, and 
rinses decline with further increases in the 


woucher. Hence, while welfare is maximized 
a voucher high enough that 86 percent of 
r. Apparently, 14 seh dents attend private schools, earnings max- 
ne to dominate im ization necessitates a substantially lower 
it (see footnote 2¥oucher that draws only 47 percent of students 
rage costs of privi® the private sector. 
| of Figure 4 shot 










Proof and detailed analysis of these points is in an 
d Epublished Appendix, which is available from the au- 
nts who are indu% TS upon request. A nascent related literature studies the 
private school, í moti al grouping of workers of varying skills into firms 
ly, the tax rate n we Michael Kremer and Eric Maskin, 1995). Every stu- 
sufficien tly late eat i i a school has his own output (achievement), while 
. gorkers in a firm have but one output. Hence, the results 
a for W É the literature on worker grouping do not immediately 
cnoois. 


$P into the problem of student grouping. 
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The aggregate effects on achievement and 
welfare summarized above are likely to be sen- 
sitive to the details of the specification of the 
achievement function. In our specification of 
normed achievement, Sains from peer ability are 
proportionate to own ability. A specification of 
normed achievement in which high-ability stu- 
dents gain even more from high peer quality 
than do low-ability students would yield greater 
aggregate gains from sorting students by ability, 
for example. As we noted earlier, there is little 
empirical evidence on this issue. 

Aggregate achievement is maximized by 
complete ability stratification in our Cobb- 
Douglas specification, as observed above. In our 
computational model, complete ability stratifi- 
cation increases (normed) aggregate achieve- 
ment by 4.6 percent over complete mixing of 
students.” One-third of this potential gain is 
achieved in the public/private equilibrium with- 
out a voucher. As we illustrated in the bottom 
panel of Figure 4, aggregate achievement is 
nonmonotonic in the amount of the voucher. 


? Normed aggregate achievement with complete mix- 
ing is given by 


An = K | exp| ~ (u + 02/2) |bf,(b) db 
o B 


- ren (1 


+ ae + 23/2) 


` 


where f,(b) is the lognormal density function. Normed 
aggregate achievement with complete ability stratrifica- 
tion is given by 


A, = KÍ bwO*! f(b) db 
o 


= Kep] (Z+ 1s + (z+ 1) 03/2] 


cow owe . a a a 


Hence, 


As Y{7 os 
— = —~|5+1)—] = 1.04 
x s| ( ) 2 | 1.046 


for our calibration. In steady state with complete mixing, 
one can show that g} is invariant to y/@. Hence, the pre- 
ceding formula can be used to determine how potential 
gains to ability stratification vary with y/@. Such gains 
increase with y/f (e.g., to about 45 percent for y = £). 
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Maximum achievement occurs at a voucher of 
$2,800, and this maximum is 50 percent of the 
potential achievement gain of moving from 
complete mixing to complete ability stratifica- 
tion. We emphasize that such aggregate gains 
can undoubtedly be made either larger or smaller 
by varying the relative benefit of peer quality to 
students of different ability Jevels._ 

The remaining figures illustrate the distri- 
butional effects, of the voucher. The effects of 
the voucher on achievement and welfare can 
be divided into the impacts on the four groups 
illustrated in the upper panel of Figure 5. Area 
A contains students who are in the public 
schools before and after the introduction of a 
$2,000 voucher. Areas B and C combined are 
students who switch from public to private 
school when the voucher is introduced. Area 
D contains students in the private-school sec- 
tor before and after the introduction of the 
voucher. 

The primary gains in achievement accrue to 
students who switch from the public- to the 
private-school sector with introduction of the 
voucher (areas B and C). They have achieve- 
ment gains ranging between 12.9 percent and 
20.2 percent. The major losses in achievement 
are experienced by students who remain in the 
public school after the voucher is introduced. 
They all experience a 4.9-percent loss. The lat- 
ter group is, of course, much larger than the 
former. 

The welfare effects (measured by com- 
pensating variation) are distributed some- 
what differently. Students who remain in the 
public sector (area A) all experience welfare 
losses. The quality of the school they attend 
has deteriorated, and tax changes are small 
as discussed above. Since public schools 
charge a price of zero, the voucher does not 
reduce the cost of education for public- 
school students. 

Paradoxically, some of those who switch 
from the public to the private sector (area B) 
are also made worse off. Their alternatives are 
adversely affected by the voucher. They can 
either stay in a public-school_system of re- 
duced quality, or they can pay tuition at private 
school. They choose the latter, but the voucher 
defrays only a portion of the cost. Thus, while 
they have large achievement gains, those gains 
are more than offset by the reduction in in- 


. Figure 2) to 398, 351, 326, and 308, respectively. II% 


































come net of tuition.” For each income leyf 
the largest loss within this group is sustaing 
by the students at the lower boundary of arg 
B. By boundary indifference, the magnitude gf 
loss for a student on this lower boundary is tyf. ~. 
same as for a comparable student on the Uppak: 
boundary of region A. Students on the boung 
ary between regions B and Ç neither gain ng} 
lose from the voucher. i 

The remaining two groups (C and D) gait 
from the voucher. The largest gains as a prof 
portion of income accrue to high-ability, low. 
income households. As the voucher increasef 
the demand for private education, it increase 
competition for high-ability students and th} 
financial aid they receive.*! The highest-abiliyt 
students experience a -tuition reduction that i} 
almost twice the amount of the voucher. Thi 
greatest gains are thus in the lower-right pork. 
tion of the upper panel of Figure 5. 

The bottom panel of Figure 5 illustrates fuf 
ther the distributional effects of the $2,00}: 
voucher. Compensating variation as a per} 
centage of income is plotted for four differen 
income levels as a function of ability. This fig 
ure demonstrates that the gains accrue to thos} 
with high ability, and among high-abilit} 
households, the gains are proportionatelt 
greater for low-income households. Losses art 
realized by households that remain in th 
public-school sector due to the decline it 
public-school peer quality induced by tht 
voucher. Again, those in area B in the uppi, 
panel are also losers. The proportionate lost 
for the latter two sets of households is low, bi 
they make up a majority of the population. ¢ 


3 
y 


3 This result is similar to Benabou’s (1996a) finditifi, 
that equilibrium segregation across communities (2>#} 
schools) by endowed human capital (ability) can Of g 
high-human-capital types more in housing-price pregi. Figure ( 
than they gain relative to an allocation with 
segregation. p 

3! For example, when the voucher is increased from! i: 
to $1,800, the 7’s in the top four schools rise from 3 R 5 
310, 282, and 261 (dollar discount per unit of ability; Sg: 


estingly, the increased competition for high-ability Y, 
dents actually reduces the quality of the top schools, é de 
entrants bid some of these students away, and the bovFpy 
ary loci between private schools become flatter. Comps and 
the 9’s of the top schools in the two panels of Figure), 

This phenomenon persists in the steady state. s 
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‘Figure 6 illustrates the distributional ef- 
ts of the voucher in steady state. For each 
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age change in expected steady-state income 
relative to the zero-voucher steady state. One 
would expect gains to accrue to the bulk of 


lity and a voucher level, we calculate the 
xpected steady-state income of a student’as 
Sllows. Given a student ability and knowing 
EC Steady-state distribution of parental in- 
we, we calculate the probability that a stu- 
peut will attend each of the available 
hools, Using this, each school’s quality, 
; ad the Student’s ability, we calculate ex- 
ed income. Figure 6 shows the percent- 












the relativery highest-ability students since 
the relatively highest-ability students are 
most likely to attend a higher-quality school 
as a result of the introduction of the voucher. 
Lower-ability students comprising approxi- 
mately 70 percent of the population are made 
worse off because they are likely either to 
remain in the public sector when the voucher 


er is incrcased frot 
ichools rise from 4 
per unit of abili 
)8, respectively. Inf 
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FIGURE 6. PERCENTAGE STEADY-STATE CHANGE IN EXPECTED INCOME FROM 
INTRODUCING VOUCHERS FOR STUDENTS OF DIFFERING ABILITY LEVELS 


is introduced (a public sector of diminished 
quality) or to enter a low-quality private 
school. The top 2—3 percent of the ability 
distribution (abilities 35 and higher) also 
have lower expected income because the 
very top schools that they will attend decline 
somewhat in quality (see footnote 30). 


IV. Concluding Remarks 


A recent Department of Education study 
reports that nearly half of all adult Ameri- 
cans read and write so poorly that they are 
unable to function effectively in the work- 
place, and not surprisingly, that many of 
them live in poverty (see Newsweek, 1993; 
Wall Street Journal, 1993). This depressing 
statistic, like many others before it, has led 
to calls for change in the U.S. educational 
system. There is no shortage of reform pro- 
posals-The research challenge is to develop 
models that provide systematic links con- 
necting preferences, the educational ‘‘pro- 
duction’’ process, costs, and institutional 
structure to consequences. Such models 
should develop testable predictions for val- 
idating or rejecting the elements of beliefs 


—— $1,000 Voucher 





Ability 


— $4,000 Voucher 





















E- Regarding 
our analysis 
E marized in tl 
É- These inclu: 
( Jicome and 
B Anegative c 
size and que 
all private : 
$ come for th 
K graphic m: 
that give rise to policy proposals, and thes. tuition on 
should facilitate systematic comparison ame, schools in e 
evaluation of policy alternatives. all private s 
One goal of our research is to provide ausf imizes ave 
ful foundation both in permitting formal andp shools, str 
ysis of educational policy issues and iif. tation by a! 
guiding empirical work. We are introducitf,“2A8 one 


d 


inputs into the model so that schools can cong, Wo 


e 


50 60 70 
99 Percentile Scale 


are also enriching the.-moọdel to contrast equg , tect 
librium in an open-enrollment system (as pi n 

the model in this paper) to a neighborho% £6) 
system in which students may only attend BR." W 
school in the geographic neighborhood ig: al 
which they reside. With Elizabeth Newlon, "= 
(Epple et al., 1997) have analyzed equilibriv4 Mill result 


when public schools adopt tracking by 2 gi and moven 
ity. These extensions are discussed pel iaa 
below. The model can be extended to allifi Public sec 

tome an 


for multiple peer characteristics. This is lik 
to lead to an equilibrium with a more divé 
set of private schools, diversity seen by P$ 
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* See Gamoran (1992) for an interesting empi 
analysis of tracking. E. 
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S depend also on the skills of colleagues is a key 
É step toward applying the model to higher ed- 
ication. In analyzing higher education, we 
Ealso intend to introduce a quasi-public sector 
"E(state schools) subsidized by tax dollars but 
Placing egalitarian dictates that constrain ad- 
‘BP mission and tuition policies. 
$ Regarding implications for empirical work, 
Pour analysis delivers several predictions sum- 
‘marized in the propositions we have presented. 
cale BThese include a negative correlation between 
| Eincome and ability within each private school, 
Fa negative correlation between private-school 
Kgize and quality, tuitions declining in ability in 
Pall private schools, tuitions increasing in in- 
‘come for the best private school in each geo- 
graphic market, and little dependence of 
posals, and tigtuition on income in the remaining private 
comparison agschools in each market, a scale of operation in 
tives. ‘Pall private schools lower than that which min- 
: to provide augglmizes average cost, a strict hierarchy of 
tting formal aumschools, stratification by income, and stratifi- 
y issues and | cation by ability. 
b are introduci As one of our referees stated, what we 
schools can co% Would like to know about a voucher program 
inputs as wells. “who gains, who loses, and how does it 
d Romano, 19%gadd up?” Earlier, we discussed aggregate ef- 
1 to contrast eects in detail. Here we discuss further the 
ent system (as @fcdistributional effects. In this paper, we 
b a neighbor consider an open-enrollment public system 
lay only attendi which public and private schools are 
neighborhood! equally effective in delivering education. 
rabeth Newlon,vUl model implies that a voucher program 
alyzed equilib: ie: l result in entry of new private schools 
; tracking by ae ad movement of students from the public-to 
, discussed bee private sector. Students remaining in the 
extended to alll public sector are those with relatively low 
stics. This is lif py ome and low ability, and those students 
gth a more div pr PeTience losses. Because vouchers in- 
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interesting empik: The Specific properties of the tuition functions are 
Oped in Epple and Romano (1993). 
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crease the premium on ability, the greatest 
proportionate gains from the voucher accrue 
to low-income, high-ability students. 

How sensitive are.our results likely to be to 
the assumption of a homogeneous open- 
enrollment public-school system? The conclu- 
sion that low-income, high-ability students 
experience large proportionate gains is likely 
to hold regardless of the organization of the 
public-school system. However, the distribu- 
tion of losses from a voucher program is likely 
to depend on the organization of public 
schools. For example, most public schools in 
the United States place students of differing 
abilities into different ‘‘tracks.’’ Applying the 
framework developed here to study tracking 
reveals the following (Epple et al., 1997). The 
largest proportionate losses from a voucher 
program are to students who remain in the 
high-ability public-school track when the 
voucher is introduced, because the voucher 
draws the most-able students from the high 
track into the private sector. In the presence of 
tracking, the voucher has very little impact on 
the low-income, low-ability students. Public 
schools consign these students to a low-ability 
peer group, and the voucher leads to modest 
decline in the quality of that peer group. The 
loss to such students from a voucher program 
is much smaller than the losses they experi- 
ence from introduction of public-school 
tracking. 

Consider now a neighborhood public- 
school system, without tracking, that assigns 
students to a school based on the neighbor- 
hood in which they reside. We have shown 
(Epple and Romano, 1995) that, even absent 
differences in expenditure across schools, 
Tiebout sorting will create a public-school hi- 
erarchy deriving from peer-group differences 
when income and student ability are posi- 
tively correlated. We anticipate that the intro- 
duction of a significant private sector, as 
supported by vouchers, would again benefit 
most poor, high-ability students. How poor, 
low-ability students would be affected rela- 
tive to the case of open-enrollment public 
schools is less clear. As with tracking, the lat- 
ter students begin with a weaker peer group. 
Thus, unfortunately, those who remain in 
public schoo] when vouchers covering part of 
educational costs are introduced may have 
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little to lose. On the other hand, their most- 
able peers will be the first to enter the private 
sector. This could imply even further losses 
to those who remain behind. We think this is 
an important issue for research, since neigh- 
borhood school systems continue to dominate 
in the United States. 
m . [t is often argued that large central-city 
public-school systems in the United States 
are ineffective in delivering education, and 
that the failings of these schools are visited 
primarily on disadvantaged students who re- 
side in central cities. Clearly, if vouchers 
lead private schools to supplant ineffective 
public schools: or inspire better performance 
from such ineffective schools, then a voucher 
program would lead to widely distributed 
gains. In the presence of public-school track- 
ing, for example, such gains, even if rela- 
tively modest, might be sufficient to offset the 
effects of diminished peer quality experi- 
enced by low-income, low-ability students. 
Such gains would need to be more substan- 
tial, however, to offset the losses experienced 
by students who remain in the high-ability 
public-school track after vouchers are 
introduced. 

Whether such gains in technical efficiency 
from voucher programs would be substantial 
is an open question (see footnote 4). Our pa- 
per stresses the allocative effects of vouchers 
and shows that vouchers could have significant 
distributional consequences. It indicates a 
need for continued effort to quantify the ef- 
fects of private-sector competition in educa- 
tion and to quantify the effects of school 
quality on students of differing ability levels. 
There are also implications for the design of 
voucher systems intended to promote alterna- 
tive goals. If vouchers are intended to improve 
technical efficiency without increasing ability 
segregation, then less-able students will need 
more financial assistance (or equivalent con- 
trols must be enacted). If ability segregation 
is to be further promoted to increase aggregate 
achievement, then vouchers will need to be 
income dependent, Hopefully, our framework 
will facilitate the investigation of such 
voucher-design issues, as well as the investi- 
gation of other proposals affecting the entry 
and exit of schools or student access to 
schools. 
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PROOF OF PROPOSITION 1: 

We show first that the public school has 
worst peer group. Suppose to the contrary 
ome students 
__ ting the same r. 
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ents, it is a Ic 
Any student who would have to pay a positi fmust drive this 
price to attend private school 1 would pre Ghigher-order d 
the free public school. Hence, private scho: proceeding. 
1 could not be profitable. 
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Showing that a strict hierarchy of priv 
schools characterizes equilibrium is considy 
ably more involved. The proof is by conti 
diction, so assume that 6; = 6; for some i +j{ 
i,j = 1, 2, ..., n. We show that this implies} 


A2) Talb, 


anne. 


(A1) pi(b, y) = p;(b, y) = MC; (b) 


Denote the ex 
script since tt 
R); 12 is ar 
with æa; (b, y) > 0 and/or a,;(b, y) > 0. ‘one obtains 


= MC,(b) V (b, y) 


We will go on to show by construction the. 
(A1) implies that school i (or j) can increas 
profits by admitting and expelling certain si 
dents, contradicting profit maximization. $ 

Condition (6a) implies p;(b, y) = ply 
y) V (b, y) since 6; = 6;. Condition (6b) ay 
market clearance imply that MC; (b) = pi(lt. 
y) = p;(b, y) = MC,(b) for students wit 
attend school j and, analogously, MC,(b)% 
pi(b, y) = p;(b, y) = MC; (b) for studet: 
who attend school i. The linearity of MC(: 
implies: (a) MC,(b) = MC;(b) Vb; ta 
MC; (b) < MC;(b) V b (or equivalently, W% 
reverse); or (c) MC; (b) > (=) (<) MCh 
as b > (=) (<) b', for some b’ (again, " 
versing i and j provides an equivalent). Cai ` 
(b) precludes school j from admitting oki i 
students and can be rejected. Case (c) PM. 
cludes ð; = @;, since school i would ayer" 
only student types with b = b’, and schod a 
would admit only student types with b 2) ~ 
This leaves case (a), which then imp 
(A1). i 





d ap L 88 NO. I 
| We now show that student types (b,, y; ) and 
Ep, y) exist, with a(b,, yı) E (0, 1] and 
; j (ba, Y2 e (0, 1], such that school i can in- 
: ease profits by admitting the same number 
school has ¢ BE (bi, yı) types as it expels of (52, ¥2) types. 
2 contrary ti, hool i will increase profits then by admitting 
B me students who initially attend j but expel- 
9, a g the same number of its own students. Note 
| ‘at, since tuitions equal effective marginal 
sts in school i for both such types of stu- 
_ dents, it is a local nonconcavity of profits that 
pay posit ist drive this result. Hence, we will consider 
l would pref higher-order derivatives. 
| private sch, Proceeding, we have 
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'(b,y) ŠDenote the expression (A2) with 7} [the sub- 


ficript since the first variation is for type (b;, 
®;)]; t} is analogous. Using (5c) and (5d), 
a(b, y) > 0. fone obtains 
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ar: 


A4 O EEE re Ste 
( ) a Ole; (bi, yf (br, yi) | Ila Cb, ya) (ba, ya) J 


LE TW") + 


wee 


Op* (Yi bi, 0i) (> = =) 
08; ki 


e Op; (y2, bə, 9;) (2 = *) 
06; ki 


(>=) ðp* 
- (| — | | a, fab dy 
ki kidt 86; 


(bi — 0) (b — aff Op; 
+ — al ~r a; f db dy. 
k; * 09? 


The expression for 75, is analogous to (A3). 
Let A; equal the change in the number (den- 
sity) of types (b;, y; ) enrolled in school i. For 
A,’s sufficiently small, Taylor’s theorem im- 
plies the sign of the change in school i’s profit, 
Az;, will be the same as the sign of 


(A5) mi Ay + wh As + iiri aA) 
+ W5( Ay)? + 2m }2A, Ao]. 


We know that r} = 74 = 0 since prices equal 
effective marginal costs. We will consider ad- 
mission changes such that A, = — A;. Hence, 
(A5) simplifies to 'A(A D imi + rh - 
272]. Substituting and simplifying yields 


(A6) sign Az; 


. 2 ô E „b Oo; 
= sen? (b — e Za 19.) 


= Op;* (Y2; by, 6;) 
are Ma sia 06; - 





-bK f ge? * 
4 (2) f a aif db ay}. 
Ss 


Suppose that y, is greater than y, but very 
close to it, and that b, is greater than b, but 
closer still to b}, formally, of lower-order 


“58 THE AMERICAN ECONOMIC REVIEW 


-difference. In a moment, we show that 
school i can substitute students this way. 
From (6a), 


dp* _ 9U/00, 
36,  ƏUlƏy, ` 








Lemma 5 in the Appendix to Epple and 
Romano (1993) shows that př (b, y, 0) is 
continuous in (b, y). By SCI and this lemma 
(keeping in mind that b, is extremely close to 
bz), the first term in (A6) is positive. More- 
over, it will dominate the second term due to 
the lower order of the difference (b, — b2) and, 
again, using the lemma. 

The result then follows so long as such a 
substitution of students by school i is feasible. 
Such a substitution is clearly feasible if there 
is a positive measure of type space over which 
both schools i and j admit students. Suppose, 
however, that no such overlap exits. School i 
accomplishes an equivalent substitution of stu- 
dents in what can be thought of as two steps. 
It first expels half of all of its students and 
admits half of all of the students who attend 
school j initially. Since 6; = 0; and MC; = 
MC;, this step has no effect on i’s profits or 
0;, k;, and n; (hence MCG; ).' Now the profit- 
increasing substitution is feasible, which con- 
stitutes the second step. In this case, the 
variation that increases profits is just more 
involved. 


PROOF OF PART (ii) OF PROPOSI- 
TION 3: 

Household choice is as though tuitions 
equal effective marginal cost. by part (iii) of 
Proposition 2; that is, type (b, y) chooses 
school i over j if and only if U(y, — MC; (b), 
a(6;,b)) = U(y, — MC,(b), a(6;, b)). Keep 
in mind our convention that MC (b) =0 Vb 
and let 7) = 0. Also, let p? = V'(k;) + imi, 
i= 1, ..., n, and pi = 0, so that the first ar- 
gument in the utility function is y, + nb — p°. 
Choice over schools by type (b, y) facing tu- 
ition MC, (b) is equivalent to choice over 

“schools by type (b, ye + n:b) facing type- 
independent tuitions pf. We map the school- 
choice problem into (8, p*)-space and show 
that a standard single-crossing argument im- 
plies SBA. 
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where U(-) has the same arguments as if olution present 
(A7). The first term in (A8) is nonnegati | described i in th 
since #(@) is nondecreasing, and the secon: their local max 
term is obviously positive. These indiffereny} gnti] the profit 
curves also have slopes that are strictly if minimized. On 
creasing in b for 0 > 4 and weakly increasix’ reducing the fi: 
at @ = ĝo. Differentiation of (A8) with resp putational diff 
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where b in the first argument of U(-) is h , Complex and 1 
constant in the latter term. The first termig:‘:The natura 
(AQ) is strictly (weakly) positive by SCI fo also €ncounte: 
6 > (=) 6, and the second two terms #f. Wat, in two sta 
weakly positive by #’ = 0 and W-SCB, *g:l enter (e.g, 
spectively. Given that slopes of indent ash game ir 
at the secon 


resolutions of it, and the related literature % mn ymous cr 
club theory. Consider a quasi-equilibrium $ Src Th 








a ocal profit maximization [i.e., (6a) —(6c)] 
Die all private schools. Proposition 1 would 


sing and diff 
tisfying 7(0,) i yntinue tO. apply, implying that the private 


y such functi" pols serve different niches of students. Un- 
it to ; Foss the distribution of types is ‘‘just so,” the 
ig fitability of serving the different niches will 

\0;)b ery, Each private school is maximizing prof- 
| zs locally, but the profit function (which the 
Werility-taking schools share) has multiple 
bd]. Fe aks (in the space of functions [p(b, y), a(b, 
)-space are #1), and only one school is at the global max- 
} fimum. The other private schools are at lower- 

Woiofit peaks and would have an incentive to 

MEimimic the tuition/admission policy of the 
‘Bhighest-profit school. This problem persists re- 

E sardless of the (finite) number of schools, so 

Wihat ZIT cannot generally be satisfied. 

Ba) (a/a8) Been meant problem is fundamental to 
(-)/Oy, ‘Melub economies (see below), and no easy res- 
arguments asMolution presents itself. The epsilon equilibrium 
$) is nonnegaifdescribed in the text keeps private schools at 
, and the sec@their local maxima and, roughly, allows entry 
‘hese indiffere@¥intil the profit peaks and their differences are 
at are strictly minimized. One could make e arbitrary low by 
weakly increas¥fjeducing the fixed costs, but this presents com- 
(A8) with resp§fpotational difficulties and sacrifices realism. 
_ {Related to this, one might also assume con- 
agitant costs of schooling. This would lead to an 
}(Ga/00) | , SBhfinite number of schools serving infinitely 
ay, iBrefined peer groups. We have made some pro- 
on ‘ggfess on such a specification in a special case 


ees 





-With the help of an anonymous referee. We 
a) (2100 | tieve exact equilibrium exists in such spec- 
{-)/Oy, | fications, but this has not been proved. While 


f i model is extremely interesting, it is quite 
ent of U(-) is beemplex and not yet tractable. 
_ The first terilfThe natural game-theoretic specification 
positive by SCIS encounters existence problems. Suppose 
ond two terms $al in two stages, private schools first commit 
0 and W-SCB,@ enter (e.g., pay fixed costs) and then play a 
pes of indifferge? h game in tuition functions. We believe 
A follows by aS the second-stage profit functions will con- 
rent. gate to be multipeaked, and Bertrand-liké 
EK Of pure-strategy equilibrium will arise. 
—Here we destigeotchmer (1985) shows existencé of equilib- 
problem, poteen In pure strategies in the case of clubs with 
related literatumy p Ymous crowding using a game-theoretic 
hasi-equilibrium g$ koach. The difference is that incumbent 
brium requirem@e.’S carn the same level of profits, a property 
ubstituting an e$ Ur model will not generally have, be- 


er of private sch! use the variety of types induces differenti- 
{ with a require 
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The problem we encounter is familiar to 
club economies with nonanonymous crowd- 
ing. Either economies of scale or indivisibility 


of club members generally leads to existence -- 


problems in such economies, both causes de- 
tailed in Scotchmer and Wooders (1987) and 
Scotchmer (1997). Our continuum of agent 
types also implies a continuum of each type, 
eliminating the latter existence problem as in 
Scotchmer (1986). But the first existence 
problem remains. In lieu of exact equilibria, 
epsilon-competitive equilibria are also studied 
in this literature, with the difference that it is 
club members rather than club owners who do 
not fully optimize. A correspondence between 
exact equilibrium and the exact core and be- 
tween epsilon equilibrium and an epsilon core 
is demonstrated in this literature. This nor- 
mative focus highlights consumer behavior 
rather than ‘‘firm’’ behavior, and consumers 
are the natural candidates for deviation from 
full optimization. Our positive focus provides 
a more salient role for firms (private-school 
owners), and they are more convenient can- 
didates for deviation than the infinity of house- 
holds. Hence, our epsilon equilibrium has 
households fully optimize but not private 
schools. 
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i È Who will cure the nation’s ills? 
fA leader with a selfless will. 


ars ete TORE But how will you find this leader of 
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> P Bt o E By a process of natural selection of 
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add, ed., Holi}: W. H. Auden (1940) 


/ashington, Dey: 
6, pp. 149-74 : ; 
ea B. and Th% Across diverse circumstances, there is often 
ear Report: lf jfegularity in the manner in which a person 
ogram.”’ immesponds to his or her environment. This ob- 
Madison, Dect stved unity to a person’s behavior is part of 
-ahat defines that person. In the political arena, 
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People differ in how they respond to their environment. Some individuals treat 
each situation as unique and tailor their behavior accordingly while others re- 
spond in the same manner regardless of the situation. My objective is to explore 
how social systems select from such a heterogeneous population. A class of simple 
: hierarchical systems is considered which encompasses some features of corpo- 
Æ rations and electoral systems. A selection process operates on this population 
| Barbara L. ‘fei which results in successful agents going on to compete against equally successful 
ice?” Ameri, agents for further advancement. I characterize the population dynamics and the 
type of agent that ultimately dominates. (JEL D00, D23, D72) 


some politicians routinely support the popular 
positions on key issues while others, often re- 
ferred to as ideologues, appear to embody a 
particular world view and support positions 
consistent with that view. The former are 
driven by the ‘‘here and now’’ of getting 
elected while the behavior of ideologues is 
rooted in those past events which determined 
their ideology and their commitment to that 
ideology.' Representative of such a contrast 
are Winston Churchill and Prime Minister 
Stanley Baldwin during the 1930’s (William 
Manchester, 1988 p. 219): 


Winston was guided by a built-in gyro- 
scope which would carry him toward his 
objective through tumult, while the 
prime minister relied on a kind of soci- 
ological radar—signals from voters—to 
determine his course. 


A related form of heterogeneity is mani- 
fested in organizations. There are those indi- 
viduals who strive to please their superior— 
commonly referred to as ‘‘yes men’’ —while 
others (let us call them mavericks) appear to 


-~spossess a-definite opinion as to what is the 


proper action, independent of their superior’s 
opinions, and are committed to acting on it. 
‘Yes men’’ are driven by the expedient 


' See Aaron Wildavsky (1965) for a comparative dis- 
cussion of ideologues and pragmatists and Gary M. 
Maranell (1970) for a ranking of U.S. presidents in terms 
of flexibility. 
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dictum ‘‘to get along, you go along,” while 
mavericks ‘‘march to the tune of a different 
drummer.’’ Turning to the ranks of business- 
men, one finds that some are quick to adjust 
their business strategy to new market condi- 
tions while others dogmatically cling to their 


_ original plan.? A classic example is Henry 


“Ford, whose stubborn commitment to the same 
strategy arguably caused Ford Motor Com- 
pany to squander its leadership. Alfred P. 
Sloan, CEO for General Motors, commented 
(1965 pp. 163, 437): 


Mr. Ford, who had so many brilliant in- 
sights in earlier years, seemed never to 
understand how completely the market 
had changed ... [He] stayed too long 
with his old and once dominantly suc- 
cessful concept of the business. 


An entrepreneur who was no stranger to 
change is Sam Walton (1992 p. 169): 


When folks buy into a way of doing 
things, and really believe it’s the best 
way, they develop a tendency to think 
that’s exactly the way things should al- 
ways be done. So I’ve made it my own 
personal mission to ensure that constant 
change is a vital part of the Wal-Mart 
culture itself. 


The commonalty to these diverse settings is 
that they all encompass people who differ in how 
they respond to a change in circumstances. Pol- 
iticians who follow the dictates of their constit- 
uents, members of an organization who ‘‘go 
along,” and entrepreneurs who routinely modify 
their business strategy are flexible in that their 
behavior is driven by the conditions of the en- 
vironment— whether it is to win an election, get 
promoted, or increase profit. In contrast, political 
ideologues, organizational mavericks, and dog- 
matic entrepreneurs are rigid in that they appear 
to be endowed with some fundamental approach 


? William D. Guth and Renato Tagiuri (1965) discuss 
how personal values might influence corporate strategy. 
At the level of business strategy, the trade-off between 
committing to an approach and committing to being flex- 
ible is the focus in Robert H. Hayes and Gary P. Pisano 
(1994). 
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and rarely veer from it. Their behavior is ñ 
tively unresponsive to the current shape of eats 
environment and is instead determined by ii 
events that established the properties of Aip 
code of conduct.? ; 

The central premise of this ‘study i is taf 
substantive source of heterogeneity ammi 
people is the degree to which they modify 
behavior in response to a change. in their gf 


a diverse population ‘within the confines of revel, in Elc 
government, corporation, or army. This leaf 
to a central research question: given a pop! 
lation of flexible and rigid agents and a hig} 
archical social system within which the 
interact and compete, which type of agent we 
rise to the top? Are ideologues or offi 

seekers more likely to occupy high-level d$: ere 
fices like governor, senator, and preside?’ Pi 
Are apparatchiks or mavericks more prota fi paper questi 
at climbing the corporate ladder and becom nig} tule survi' 


likely to have a business strategy built on pe 
sistence and dogma or ees and change 


Toward addressing these questions, a 
velop a simple class of hierarchical societ 


dicial realm as well, as was pointed out to me by K% 
Scharfman. A rigid judge may be interpreted as one ® A 
acts in ‘‘good faith” in that he administers his post’ 3 


he may personally disagree with the decision or know) Y Jis, 
it makes society worse off (Steven J. Burton, 1992 Fp ate 
36-37). In opposition is the expedient judge who fabs 
exercise self-restraint and lets personal interests and 4° 
erences, rather than overriding principles, iiuen ; 
decisions. i 

*See, for example, Abraham S. Luchins and $ 
Hirsch Luchins (1959), William A. Scott (1966):4 
David H. Jonassen and Barbara L. Grabowski (1993)§ 































jal system influence the characteristics of 
@ ose agents who do well within that system. 
ermined by Mi. research fits into a growing line of work 
operties of Wich uses an evolutionary-style framework 
is. explore the types of preferences or behav- 
study is thf: a] rules that survive in a competitive envi- 
geneity amg ment. This work is unique in modelling the 
hey modify t ¥ocial environment as hierarchical and in hav- 
ge. in their gee agents differ in terms of their flexibility. A 
a considera Similar form of heterogeneity is considered in 
documents fohn Haltiwanger and Michael Waldman 
ong a rigidiiiy1991), Robert W. Rosenthal (1993), and 
now place fale O. Stahl (1993) and, at a population 
he confines ditgyel, in Elchanan Ben-Porath et al. (1993). 
y. This lef ncompassing distinct forces, Ronald A. 
: given a po Heiner (1983) investigates the appropriate 
ents and a hÆgmount of flexibility. The flexibility of behav- 
in which t¥fjpral rules is also central to the time- 
ype of agentWonsistency problem in macroeconomics and 
gues or off¥lg the root of the debate on rules versus dis- 
y high-level ®eretion; see Finn E. Kydland and Edward C. 
and presidi Prescott (1977). From this perspective, this 
more proficigg paper questions whether agents endowed with 
er and becom rule survive at a higher or lower rate than 
ief executivet fihose endowed with discretion. 
epreneurs moti: 
egy built on pe, A Class of Hierarchical Competitive Systems 
ility and change. 
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ty more likelj 


i 





(@ | start with a hierarchical system with a low- 
ith the wind’(§ést level and no upper bound on the highest 
ill’? who wovel. At each level there is a large population 
rs before comet agents, specifically, a continuum of agents.° 

ipWhether it is a politician trying to advance 
questions, | @itom the state legislature to the House of Rep- 
archical sociegfesentatives or a regional manager striving to 
gents differ income a vice president in a corporation, ad- 
My research @yancement typically requires performing rel- 
the properties@i@ively better than some subset of peers who 
‘get also eligible for promotion. This process is 
' modelled by assuming that, at each level, 
ute tain Es are randomly matched into pairs and 
mpete for “‘promotion’’ to the next level. 
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The terms promotion, advancement, and sur- 
vival interchangeably are used. The assump- 
tion that an agent is compared to only one 
other agent is-a concession to tractability but 
would seem to be reasonable for electoral sys- 
tems where general elections typically involve 
two candidates. Each of these matchings is 
faced with a stochastic environment. Once the 
environment is determined and revealed to the 
agents, they choose actions. The agent with 
greater performance is promoted to the next 
level while the other agent is assumed to drop 
out of the system or, more to the point, no 
longer be eligible for promotion. While this 
“‘up-or-out’’ structure is extreme, it is not 
without merit. Casual observation suggests 
that a large percentage of candidates who lose 
do not run again and corporate employees who 
are ‘‘passed over” when their time has come 
may no longer be on the ‘‘fast track,’ which 
makes them less likely to be considered for 
promotion.’ 

There are two possible environments which 
I denote type O and type 1. At each level, a 
proportion b of all matchings have a type 1 
environment. This is assumed to be i.i.d. 
across levels so that the probability an agent 
faces a type 1 environment is b and this is in- 
dependent of his personal history. While each 
agent faces an uncertain future environment, 
the absence of aggregate uncertainty simplifies 
matters. Without loss of generality, I make a 
type 1 environment more common: b €E 
(1/2, 1). i 

In responding to one’s environment, there 
are two generic approaches. Depending on the 
context, they could correspond to a political 
ideology, a corporate ethic, a business strat- 
egy, or yet some other concept. In this simple 
setup, an approach or a strategy corresponds 


7 Joseph A. Schlesinger (1966, 1991) documents the 
progressive paths taken to higher office. The tournament- 
style structure of organizations is examined in Sherwin 
Rosen (1986), while Raaj K. Sah and Joseph E. Stiglitz 
(1991) also explore the determinants of upper-level 
management. 

8 The case of b = 1/2 tums out to be knife-edge. When 
b + 1/2, there is a finite number of rest points. When 
b = 1/2, there is a continuum of rest points and at most 
one of them is locally stable. Details are in Harrington 
(1994). 
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to a particular action to play in all environ- 
ments. By definition, action 0 (1) is the action 
associated with strategy 0 (1).° In a manner 
to be described momentarily, action 0 (1) is 
the best action for a type 0 (1) environment. 


Since b > 1/2, action 1 is more frequently the ° 


appropriate response to the environment. 

Selection is determined as foliows. If the 
two matched agents choose distinct actions, 
then the agent whose action matches the en- 
vironment survives and is promoted to the next 
level. If both agents select the action which 
matches the environment, the agent who has 
chosen that particular action more frequently 
in the preceding h rounds advances with prob- 
ability p € [1/2, 1]. If they have chosen that 
action equally frequently, then an agent is ran- 
domly selected to survive. There is no need to 
specify whom is promoted if both agents 
choose the less appropriate action as the set of 
equations which describe the population dy- 
namics is independent of it. 

The idea is that survival depends on current 
performance which itself is determined by 
one’s action and one’s proficiency with that 
action where proficiency comes from experi- 
ence. If p > 1/2, then experience yields an 
advantage which could be due to learning-by- 
doing or, as in the electoral context, credibility 
that comes from being relatively consistent in 
one’s positions over time. If p = 1/2, then 
there is no experiential advantage. p is a mea- 
sure of how much experience matters.'° Note 
that survival depends lexicographically on 
one’s current action and one’s experience with 
that action. This means that the incremental 
effect from choosing a better action exceeds 
the incremental effect from more experience. 
h is a parameter which determines how much 
of an agent’s history is relevant for profi- 
ciency. I initially consider the case of h = 
so that an agent is more effective in using a 


* Since there is a one-to-one mapping between strategy 
sets and action sets, there is no formal distinction between 
the two. Conceptually, there is a distinction in that a strat- 
egy is a rule which maps from the space of environments 
into the space of actions. Jt just so happens these strategies 
call for the same action for all environments. 

10 The results also extend to p < 1/2, but that range is 
inconsistent with the notion of experience contributing to 
proficiency. 
















particular action if he has chosen that act; 
more often over the entire history of play} esis also fel 
then examine the case when h = 1 so that on 

the most recent past matters. The value of 
could be determined by. the rate of depreg 
tion of knowledge regarding the proper use .+ 


an action or, in the electoral context, by mey sheing T 
ories of voters. Having advancersent dept. Consequen 
only on current performance is like the gf |, © he i 
Hollywood adage, ‘“You’re only as good g reliably 
your last picture,’’ or voters’ lament of “We _, 

have you done for me lately?’’ and obvious} ..: 

implies a certain myopia or forgetfulness, alt is then c 


for the corporate setting. In Section M, p$ eprtires. 
performance i is allowed to play a limited og sr 


rule. The space of agent types is then associat 
with the space of feasible behavioral rules. 


ronment. It is then the set of functions whith rigid eae e 
map the set of environments, {0, 1}, into% ible agents. T' 
set of actions, {0, 1}, with the exception th, the characte: 
the pathological case of always choosing anak evolves as it 
tion inappropriate for the current environmġ ¢hy.’* Thoug 
is excluded. I believe this simplifies the analy% three distinct 
without any loss of generality. A flexible age types after th 
is defined to be one who always selects theay.teract with th: 
tion best suited for the environment: he choot Performance 
action 0 (1) when the environment is type! behavioral re 
(1). A rigid agent chooses the same actionif.*0f present p 
respective of the environment. A type 0 rgg.tion the popu 
agent always uses action 0 and a type 1 ñ i 
agent always uses action 1. ig 
In concluding, there are a variety of so vg. Of agent s at ; 


of eterogenei 1 in rigidity. Faced with aco Sof fo 1S pr 
or all 











opinion as to which behavioral rule is b 
Alternatively, some agents may be endo¥ 


1! For the relation between an organism’s represent 
system and his action set, see Derek Bickerton (1990) ; 
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flowing 


Ets is also re 
= 1 so that jses 


. The value ġ a `, agents cannot decipher all of the com- 


argument for constrained choice 
Jevant (Heiner, 1983 p. 585): 









Jexity of the -decision problems they 
' face, which literally prevents them from 
electing most preferred alternatives. 
Consequently, the flexibility of behavior 
‘to react to information is constrained to 
smaller behavioral repertoires that can 


be reliably administered. 






























only as good 
lament of “Wj 


It is then quite plausible that people with 
ption for the e} Mifferent powers of reasoning, recognition, 
dly, uncompeliggnd memory would have different behavioral 


H. The Unbounded Memory System 


The initial population resides at the lowest 
vel of the system and is comprised of three 
the current en ziypes: rigid agents endowed with action 1, 
f functions whgiigid agents endowed with action 0, and flex- 
s, {0, 1}, into$ible agents. The objective is to understand how 
i e characteristics of this cohort of agents 
ys choosing anmevolves as it migrates up through the hierar- 

ent environmg¢hy.'* Though the system begins with only 
plifies the analy three distinct types, there are potentially many 
. A flexible aggitypes after the population has a chance to in- 
@eract with the environment because an agent’ s 
Performance depends not only on his endowed 


ironment is havioral rule but also his personal history. 
the same actioig£Or present purposes, it is sufficient to parti- 
nt. A type 0 itn the population as follows. 

and a type 1 


Definition: Let zi € {0, 1} denote the action 
of agent s at level z. For an h-memory system, 
ent s is proficient in action i at level ¢ if 
=iforallr € {max{1,t—h},..,t-1}. 


i 
a variety of sou 
Faced with 4 oi 
ind computati 
mply differ in. 
vioral rule is W 
s may be endof 
idify their beha 

nts can more 
rent environmd e 

. the difference A If the hierarchy is to be kept “full” then, at the end 
jannot conditio :®ch round, a fresh cohort of agents must enter the 
fst level to replace those who moved on. This is not 
ant to the current analysis in that I am interested in 
a pens to a single cohort. In Harrington (1996), 
7 aa new agents is modelled where a new agent’s 
ganism’ s represent oral tule is assumed to be partly endowed and partly 
k Bickerton (1990) Fesult of imitation of those at higher levels. 





proportion of the level t population that 
are rigid agents endowed with action i 
(Ri); i € {0, 1}. 
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fi = proportion of the level t population that 
are flexible agents who are proficient in 
action i (Fi); i € {0, 1}. 

= proportion of the level t population that 
are flexible agents who are not proficient 
‘in any action (FN). 


The level ¢ state of the system is defined by 
(ri, fi, ro fh, x’). It is assumed that r! > 
0, ri = 0, and1 —r!-r}>0. 

In this section, I consider the case of un- 
bounded memory (h = œ) so that an agent’s 
entire history matters. For t = 2, the state at 
level t + 1 is described by the following sys- 
tem of equations: ™ 


(1) rit! = 2ri[(1/2)ri + (b/2)fi + brh + 
bp l= Fy — fi— ro) 

(2) fit! = 2f[(b/2)r) + (b/2) fi + bro + 
On Lary fi- ro)h 

(3) rot! = 2rh[(1/2)ra + (C1 — b)/2)f5 + 
(l= b)ri + (1 - b)p(l-ri-rg- 


fol, 

(4) fot! = 2fo((1 — b)/2)r + (1 = 
b)/2)fo + (1 — b)ri + (1 — b)pal - 
ri —ro— fo)l, 


where I have substituted 1 — ri — fi — rg - 
fo for x'. In each of these equations, the brack- 
eted term is the probability that an agent of 
that type survives. Multiplying this by the 
level ź proportion of that type and doubling it 
(because an agent could be the first or second 
draw in a matching) gives a type’s proportion 
at the next level. Consider randomly matching 
two agents and suppose the first agent drawn 
is an R1. With probability ri he is matched 
with the same type, in which case his proba- 
bility of surviving is ',. With probability fi 
he is matched with an F1, in which case he 
survives with probability '4 when the environ- 
ment is type 1, which occurs with probability 
b, and does not survive when the environment 
is type 0 as an F'1 selects action 0. With prob- 
ability 75, he is matched with an RO and sur- 
vives when the environment is type 1. Finally, 
if he is matched with either an FO or an FN, 


'3 The system of equations which determines (r7, f}, 
r3, f, x°) is different from that in (1)~—(4) because 
agents do not have a history at level -1. - 
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which occurs with probability (1 — ri — 
fi — rọ), and the environment is type 1, he 
survives with probability p because of his 
greater experience with action 1. In a similar 
fashion, the other three equations can be 
explained. 

Theorem 1 establishes that a population 
with all Ris is globally stable when the ex- 


periential advantage is sufficiently great-and/ = 


or the type 1 environment is sufficiently com- 
mon. All proofs are in the Appendix. 


THEOREM 1: With unbounded memory (h = 
œ), if pb > 1/2 then lim,.. ri = 1. 


Regardless of their initial presence in the pop- 
ulation, systems with sufficiently many levels 
find their highest levels dominated by rigid 
agents who use the action that is more fre- 
quently the appropriate response to the envi- 
ronment. Rigid agents endowed with the less 
effective action and flexible agents are even- 
tually eliminated." 

Theorem 1 is best understood by examining 
the system’s dynamics. It is straightforward to 
show that the presence of agents who are pro- 
ficient in action 0 is always diminishing: 
Ari < 0 and Afi < 0, where Ar} = rit! — 
ri and Af; = fit! — fi. Turning to those 
agents who are proficient in action 1, one can 
use (1)~—(2) to derive: 


(5) Iri > Othen: Ari 2 Oas f; 
= [(2pb —-1)(1 - ri) 

+ 2b(1 — p)ro]/b(2p — 1). 
(6) Ifi > O then: Af, 2 Oasr} 


VIA 


[((2pb — 1) + 2b(1 — p)ro) 


(b(2p ~ 1) ] - fi. 


Though F'ls are eventually driven to extinc- 
tion, their presence is increasing when the pro- 


'4 Tt is shown in Harrington (1994) that (r,, A, To, fo» 
x) is a rest point if and only if (71, fi, ro, fo, x) E ((1, 0, 
0, 0, 0), (0, 0, 1, 0, 0), (0, 0, 0, 0, 1), (0, (2b — 1)/b, 0, 
0, (1 — b)/b)}. 
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portion of Rls is sufficiently small. Tho 
Ris are eventually driven to domination, 
are decreasing when the proportion of Fj,: 
sufficiently great. Note that when rj = 0 
p = 1), (5)—(6) depend only on (r}, f1) 
Figure 1 depicts the associated phase diagy, 


— 


The use of Figure 1 to explore the dynam 
is reasonable since ROs are monotonically qf 
~ creasing and simulations reveal that they raj. 


idly converge to zero. 


Suppose the system currently has many Rif 
and F'ls so that it is initially in region L Fy 
example, when p = 1 andr} = rù, it can. 
shown that the level 2 state of the system isi. 
region I. Let me describe the resulting dyna}. , 













ical path, an example of which is depicted -. . 


Figure 2. The large presence of F 1s means thf 






Ris are frequently meeting agents whoa. . 


equally experienced in action 1 but are sup}. .- 






rior by means of their flexibility. As a result; . 









the proportion of Ris is shrinking since th : 
survive at a rate of only b/2 (<1/2) in the ; 
encounters with F is. Due to the large preseng . 
of R1s in the population, the relative advantag ` 
of an F1 is not his proficiency in action 
(since that is matched by Ris) but rather k 
flexibility. Hence, F 1s tend to do best wh 


tt 


the environment is type 0 as then, by choosig, - 


action 0, they capitalize on R1s being locky 


into action 1. Though Fis are surviving ay 


high rate, they are losing proficiency in acti 


1. This results in the proportion of F 1s shri: 
ù Proportion o: 


ing and the proportion of FNs growing (s 
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Figure 2). Since the presence of R1s is dimi ferential adv 
ishing in region I and the presence of ROs& is being prof 
always diminishing, the presence of flexit; flexible. Thi 
agents is growing in region I. When the pti when the en: 





portion of F 1s becomes sufficiently small, 


i they choose 


system moves into region II. At that point at least p, ; 


population is favorable to R1s in that they, ation 1. W: 







frequently meeting agents who are not pm 
cient in action 1, in which case their s 


rate is at least pb (>1/2). From then ont% 


o ally they ste 
a S en their 


erential ; 


proportion of Ris grows. For this path, ji flexible, in 


population dynamics are nonmonotonic in Be 


the presence of flexible agents is higher a5: 


termediate levels than at low and high levä 


Ris and Fs are largely meeting agents j“ 
are not proficient in action 1. Hence, R15* 


Fis are thriving. Note, in particular, that 
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sufficiently small,§they choose action 1, survive with probability 
n I. At that poit $e least P, and maintain their proficiency in 
tion 1. With FIs and Ris growing, eventu- 
ally they start frequently meeting each other. 


its who are not fi 
in their frequent encounters with Ris, the 


ich case their sur 


‘2). From then lcrential advantage’df an F4 shifts to being 
ws. For this pa Xible, in which case their presence shrinks 
2 nonmonotonic if they convert to FNs. This is represented by 
| agents is higher 1E “system moving into region II. At that point, 
: low and high leg” Presence of Ris is sufficiently great that 
er when the syste $7 Continue to grow and that growth accel- 
to their small pres a8 the presence of F1s shrinks. An in- 


ly meeting agents S Asung property of this path is that the 
tion 1. Hence, RE vorton of Fis is growing at low levels 
, in particular, thlg 9 though they eventually become extinct. 
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While Ris can initially founder, they start 
thriving once the proportion of Fls is suffi- 
ciently small. What is surprising is how long 
this can take. Since a flexible agent must have 
faced T — 1 consecutive type 1 environments 
to be proficient in action 1 at level T, one 
would expect there to be very few proficient 
flexible agents when T is not small and b is 
not close to one since the likelihood of 
T — 1 consecutive type 1 environments is 
b7~'. It would then seem that Rls would 
quickly overwhelm Fis: What-this ignores,. ., 
however, is that those flexible agents who are 
lucky enough to be proficient in action 1 have, 
a much higher chance of surviving by virtue 
of their proficiency. For example, suppose (b, 
p,ri, rd) = (0.6, 1, 0.25, 0.25), as in Figure 
2. Of the proficient agents, those who are flex- 
ible are on the order of 1 in every 2 agents at 
level 10, 1 in every 3 at level 15, 1 in every 4 
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FIGURE 2. UNBOUNDED MEMORY SYSTEM: EXAMPLE 
(b = 0.6, p = 1, r! = 0.25, r} = 0.25) 


at level 20, and 1 in every 8 at level 25. By 
comparison, when selection is inoperative so 
that all agents survive, of all proficient agents 


those who are flexible are around 1 in every... 


100 at level 10, 1 in every 1,000 at level 15, 
1 in every 15,000 at level 20, and 1 in every 
200,000 at level 25. Thus, selection allows 
proficient flexible agents to survive for many 
more rounds. By maintaining proficiency, 
flexible agents may be able to limit the dom- 






























relatively many levels. ag the envir 
Now suppose experience with an action ;f:.4) the actio 

sufficiently unproductive in improving syf i: except W 

vival. Theorem 2 establishes that flexjpyf ;2° 

agents will dominate. | (j) one has 


- ea aa o a | (ii). the cur 
THEOREM 2: With unbounded memory (hẹ . and 
©), if pb = 1/2 then lim,... (ri + ro) =0. (iii) one is n 
| RITE 


Since, when the system is run long enough}... 
most flexible agents are not proficient in any} Compared to 
action, the long-run dominance of R1s is conf pest action f 
tingent upon them doing well against sud} mle enhance 
agents. Given that when an R1 meets sucha} chance of sv 
agent his probability of survival is pb, thet pility of that 
long-run survival requires pb > 1/2. Othe “i: 
wise, as shown in Theorem 2, flexible agent}: THEOREM 
will dominate. œ), if flexibi 
lim, 7 = 
Ill. Extensions of the Unbounded a 
Memory System f While this r 
| the time at ` 
The objective of this research is not to def ficiency, sv 
termine whether, in general, rigid behaviors} ingly, since 
prevalent but rather to understand how ths their flexibi 
properties of a social system relate to the typ} tion of Ris 
of behavior that thrives within that system for all initia 
Contrary to a common view that flexibilityi~ orem 3 in t! 
a superior trait, the analysis of the previouf ' A related 
section identified one class of social system: flexible age 
in which rigid agents do quite well. I now watt: identically 
to consider related systems so as to both asses: &gents use 
the robustness of this result and to understa¥f, chance of : 
what properties promote rigid behavior. Ty lime for w} 
simplify proofs, I assume p = 1 for the mf *** 
mainder of the analysis. H (8) Atlev 


Ree 


A. More Sophisticated Flexible Agents A > 


Whenever one engages in an analysis ty 
assumes agents are endowed with behavio 
rules, it is important to ask whether there ! 
behavioral rule not in the feasible set WE 
would have thrived. Along these lines, J f$ 
beyond strategies that condition only on ?@: 
current environment and consider makit® 
flexible agents a bit more sophisticated. 1%} 
preceding analysis suggests that a flexit 
agent, when faced with a rigid agent, mp 
want to mimic a rigid agent. This is embod 
in the following behavioral rule. 
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At level 1, choose the action that matches 
`; the environment. At level t (=2), choose 
th an action the action that matches the environment 


„except when: 


tos 


(i) one has always chosen action /; 
) the current environment is type j (+i); 


and 
) one is matched with an agent who is type 
Ri, i € {9, 1}. 


roficient in am ‘Compared to the rule of always choosing the 
of R1s is ú pest action for the current environment, this 
Il against irole enhances proficiency while maintaining a 
meets suchifchance of surviving today though the proba- 
val is pb, Èbility of that event is lower. 

> 1/2. 0 be 
, flexible age THEOREM 3: With unbounded memory (h = 
), if flexible agents use the rule in (7) then 


į lim, ~o ri = 1. 




















e time at which an flexible agent loses pro- 
ciency, such remains inevitable. Interest- 


well. I now waidentically proficient agent. If all flexible 
as to both assagagents use such a rule then one maintains a 
d to understithance of surviving today and lengthens the 
id behavior.igtime for which one is maximally proficient.'® 


the environment. At level t (=2), choose 
lexible Agents the action that matches the environment 
except when: 


"An agent who only knows the history of environ- 
ts and actions of the other agents may not be able to 
plement the rule in (7). For example, if another agent 
S always chosen action 1 and always faced a type 1 
i. 3a aG pia he re be either ys F TE The 
: ime, . .\/2 1S then a best-case scenario for flexible agents 
ts that a flew A fit presumes more information than they realistically 


ition only on} 
consider maw 


rigid agent, ME Neuld have, 
. This is emb of ae tule can be implemented with information only 
rule. ‘fey ther agent’s past actions. m 
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(i) one has always chosen action i; 
(ii) the current environment is type j (+i); 
and 
(iii) one is matched with an agent who has 
always chosen action i; i € {0, 1}. 


THEOREM 4: With unbounded memory (h = 
œ), if flexible agents use the rule in (8) then 
lim, (ri + fi) = 1. 


While flexible agents can now survive in the 
long run, this is only achieved by perfectly 
mimicking rigid agents! Since, in the limit, 
Fis face only other Fis and R1s, it follows 
from (8) that they choose action 1 regardless 
of the environment. Furthermore, if they 
ever consider deviating from this rule and 
choosing action 0 when the environment is 
type 0, it is known by the argument in the 
preceding section that they will eventually 
be driven out. 


B. Survival Depends on Past Performance 


Thus far, survival depends only on current 
performance which itself depends on an 
agent’s action and his experience with that 
action. In many settings, past performance 
would seem quite relevant. For example, 
promotion in a corporation is apt to depend 
on one’s performance throughout one’s ten- 
ure, though perhaps with disproportionate 
weight placed on recent results.'’ To encom- 
pass past performance, consider the follow- 
ing selection rule. 


(9) The agent who chose the best action for 
the current environment survives. 
If both agents chose the best action then 
the one who more frequently in the past 
chose the best action for the environment 
at that time survives. 
If both agents chose the best action and 
equally frequently in the past chose the 
best action for the environment at that 


17 A corporation might choose to focus on current per- 
formance under the presumption (perhaps false) that one’s 
past performance must have been reasonably good for the 
agent to have advanced to the current level. 
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time, then the one who chose the current 
action more frequently in the past 
survives. 


Otherwise, let me retain the specification in 
Section II. Survival depends lexicographi- 
cally on the current action, past perfor- 
mance, and experience. Given that flexible 
agents always choose the best action and al- 
ways have maximal past performance, en- 
compassing past performance results in 
flexible agents thriving. 


THEOREM 5: With unbounded memory (h = 
œ) and the selection rule in (9), lim,..(ri + 
rh) E ((2b — 1)rj/b(1 — r}), ri +74). 


Since lim,..(r, + ri) <r! + rj, then this 
social system favors flexible agents. However, 
rigid agents are not driven out. For example, 
when b = 0.75 and the initial population is 
equally divided between R1s, ROs, and flexible 
agents, rigid agents make up around 36 percent 
of the highest levels in systems with suffi- 
ciently many levels. 

To explain how rigid agents survive, let R1* 
denote a rigid agent endowed with action 1 
who has always faced a type 1 environment. 
The notable property of an R1* is that he has 
always chosen the action which is best for the 
environment, just like a flexible agent. By the 
selection criterion in (9), survival of rigid 
agents rests upon R1*s doing well. This may 
not seem very promising because it would ap- 
pear unlikely for any agent to continually face 
the same environment. Indeed, for the selec- 
tion criterion in Section H, one can show that 
the proportion of agents who are R1*s goes to 
zero as t-> ©, However, that selection criterion 
provided no pressure for Rls to accumulate 
such a history. In contrast, given the selection 
criterion in (9), Rls who just happen to face 
a type 1 environment in, say, the first two 
rounds, have a higher chance of surviving then 
Ris who have not. In particular, when faced 
with any type other than an F1, the survival 
rate of an R1* (as an R1*) is b (which exceeds 
1/2) in that he survives for sure when the en- 
vironment is type 1. For example, against a 
flexible agent who has not always chosen the 
same action, an R1* survives when the envi- 
ronment is type 1, as he has the advantage of 
























having always chosen the same action | 
third part in (9)] and, like any flexible ag 
has always chosen the action right for the a 
vironment including the current one. Agay a 


tories and choose identical actions. Given ty! 
long-run survival requires a survival rate of gow consider 
least 1/2, the long-run survival of R1*s (af Gyherwise ma 
thus rigid agents ) is then contingent upon thet: Ji (except foi 
not being too many Fls. What assures te} geularly inter 
there are not too many Fis is that the dynan of my earlier 
affecting F1s is exactly the same as that fy period memo 
R1* since the two agent types are identiyé;-determined s 
(and, as a result, the ratio of R1*s to Fls the preceding 
constant; see the proof of Theorem 5 in te ti 
Appendix). Therefore, Fis and R1*s shri 
together and eventually stabilize at levelg: proficient so: 
above zero. In particular, one can show t% RO, FO}. Th 
the proportion of agents who have alwag : 
faced a type 1 environment— R1 *s and Fs 
converges to (2b — 1)/b as t > œ with hy 
split between them depending on the leveli 
population.'® In short, while it is relatively wi 
likely for any specific agent to exclusively far 
a type 1 environment, some agents will ha 
done so and selection results in them beiy 
disproportionately represented in the next ger} 
eration. In this manner, the unlikely —a rig 
agent having always chosen the action bestfi. 
the environment—is made likely by selectitg: .- 
pressures and results in rigid agents survivilt§: 
I find that allowing selection to depend a} 
past performance is conducive to flexibtt: 
agents dominating though rigid agents sift : 
manage to survive. Finally, let me note thalif 
the third part of (9) were replaced with amp. : ~ 
dom selection of agents, there would be 8 J, 
virtue to being rigid. In that case, rigid aget}; 
would be driven out and only flexible age 
would be present.’ i 


(10) ori 


“ay ft 


18 Since, the proportion of R1*s is positive as t°% 
this follows immediately from (A10). 7 

1° If the third part in (9) was replaced with the flip’ 
a fair coin then AF = Fi [(2b — 1) — (Pi + 4], Wi 
ĵi denotes the proportion of R1* agents. Since fi? 
implies AF, < 0, it follows that lim,.. fi = 0. j 


















i Vv. The Bounded Memory System 


‘An implication of assuming the system has 
"bounded memory 1S that a flexible agent 
$ tho becomes less experienced than a rigid 
agent js always less experienced. Since such 
© assumption is clearly extreme and could be 
crucial for the domination of rigid agents, I 
sow consider a bounded memory system while 
‘herwise maintaining the structure of Section 





if my earlier results, the extreme case of a one- 


seriod memory is assumed. Proficiency is then 

determined solely by an agent’s behavior in 

he preceding round and, in principle, is réla- 

ely easy to achieve. 

$: With a one-period memory, all agents are 
proficient so that there are four types: { R1, F1, 

, FO}. The dynamical system for t = 2 is: 


rit! = ri{(1/2)r4 + (b/2) fi 
+ brh + bf], 


exclusively fa z 
agents will hig 

s in them bey 
d in the next 
nlikely—a 





) fit! = 2fiL(b/2)r) + (b/2) fi 





+ bro + bfo] 





+ 2folbro + (b/2)fol, 


ro) = 2rol(1 - b)ri 
+(1-—b)fi + (1/2)ro 


+ ((1 — 6)/2) fol, 


= 2ftf (1 = b)i + (1 — bY FI 
! Rae's coe) ae 
| + (C1 = b)/2) fi] 


+2 -b)ri 






10). É: 
eplaced with the Hf 


+ ((1 — b)/2) fi]. 
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In (11), 2f6[(b/2) fo + bro] represents those 
agents who switch from being an FO to being 
an F1, while in (13) 2f{((1 —b)ri + (i — 
b)/2) fi] captures the flow of FOs to F1s. In 
contrast, with unbounded memory, the only 
flow was from Fls and FOs to FNs. _ 
Though the bounded memory system does 
not have a global attractor, it does have as- 
ymptotic attractors. Let d((ri, fi, ro, fo), 
(ri, fi, ro, fo)) be the Euclidean distance 
between states (ri, fi, rô, fo) and (rj, fi, 


ro, fo): 


Definition: (r, fi, ro, fo) is an asymptotic at- 
tractor if there exists s > O such that if 
d((rj, fi rå, fo); (ri, fis To, fo)) <E then 
liM, r1, fis ro, fo) = (ro fis To, fo): 


A state is an asymptotic attractor if the system 
converges to it when it is in a sufficiently small 
neighborhood of it. This is then a kind of local - 
stability. 

It is shown below that the bounded memory 
system has two asymptotic attractors. Given 
the ease with which proficiency is achieved, 
one has all flexible agents. More interestingly 
is that the other has all rigid agents (endowed 
with action 1).”° 


THEOREM 6: With bounded memory (h = 
1), (ri; fis ro, fo) is an asymptotic attractor if 
and only if (Ti, fis ro, fo) E {(1, 0, 0, 0), (0, 
b,0,1—-—b)}. 


To explore the system’s dynamics, I con- 
struct a phase diagram under the assumption 
that rf = 0. As with unbounded memory, the 
proportion of ROs monotonically converges to 


2 A qualification to Theorem 6 is in order. By defini- 
tion, a rest point is not an asymptotic attractor if within 
any neighborhood around the rest point there are states 
such that the system does not converge to it. This leaves 
open the possibility that there could be other states within 
the neighborhood for which the system “does ‘converge. 
While this is not true for the rest point with all ROs, I have 
been unable to prove that there do not exist convergent 
paths for the rest point with a mix of flexible and rigid 
agents though, after an extensive numerical search, no 
convergent paths were found. Note, however, that if the 
limit is taken so as to make this a continuous time system, 
this rest point would be a saddlepoint which would imply 
that almost all paths would lead away from it. 
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FIGURE 3. BOUNDED MEMORY SYSTEM: PHASE DIAGRAM (ASSUMPTION: r = 0) 


zero and simulations show that the rate of con- 
vergence is rapid. Given ro = 0 and substitut- 
ing 1 — ri — fi for fo m (10)-(11), we 
derive: 


(14) Ifri > o0 then: Ari 2 0 as f; 
= [(2b — 1)/b] (1 — ri) 
=I\ri), 

(15) ffi > 0 then: Afi 2 Oas f} 

. = (60 -riy (A = bri) ] 
= Ari). 
T(-) is the same as when the system has un- 


bounded memory (assuming p = 1) while 
Q(-) is distinct. 



























i . ory does n 
; dynamics 
> qs sufficie: 


| agents cai 
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; at level 7 
> an agent j 
| tegain pr: 
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i to meeta 
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(16) (0) = b > (2b — 1)/b = T(0); 
F. flexible : 
P. ciency to 
F. - When 
A agents is 
$ lation dy 
E. the syste 
k though R 
$ ciently fe 


Q(1)=1=T(1). 
Q'(ri) = -[b(1 — ri)/(1 — bri)?] 


x[Q -b)+ (1 -brip 


<0 Yr; € [0, 1); Q'(1) = 0. 
T’ (ri) = -[(2b - 1)/b] <0 


Vri € [0, 1]. 


Using the properties in (16), the phase dg: 
gram is constructed in Figure 3. The two %% <; 
ymptotic attractors are represented by (/ 
fi) = (1, 0) and (1, fi) = (b, 0). (r, fi) 
((1 — b)/b, (2b — 1)?/b?) represents a ® 
point that is not asymptotically stable. : 











































t note that when the proportion of R1s 
ciently great, the population dynamics 
> qualitatively similar to when memory is 


F-n light of the qualitative distinction between 
3 bounded and bounded memory systems. 
en memory is unbounded, proficiency in 
the better action requires a flexible agent to 
have always faced a type 1 environment (and 
layed action 1). Flexible agents are then al- 
most surely less experienced than rigid agents 
when the system has sufficiently many levels. 
a the bounded memory system, a flexible 
agent who chooses action 1 becomes proficient 
Ein action 1 regardless of his preceding history. 
IB In spite of this difference, bounding the mem- 
F ory does not qualitatively affect the population 
amics as long as the initial presence of Ris 
sufficiently great. 
To explain this finding, recall that flexible 
gents can continuously gain and lose profi- 
jency in action 1 when memory is bounded. 
n FI faced with a type 0 environment at 
level t will become an FO (due to having cho- 
n action 0) but switches back to being an F1 
t level ¢ + 1 if the environment at that time 
type 1. The difficulty, however, lies in that 
an agent is unlikely to survive long enough to 
Rfegain proficiency in action 1. With a strong 
E presence of R1s, a flexible agent is very likely 
(HBto meet an R1 who will take advantage of this 
AW flexible agent’s ‘‘temporary’’ lack of profi- 
)/b = 1(0); 
Agate even when it only takes one period for a 
flexible agent to achieve comparable profi- 
Giency to that of a rigid agent. 
When instead the initial proportion of rigid 
agents is small, Figure 3 shows that the pop- 
ation dynamics are very different from when 
€ system’s memory is unbounded. Even 
hough Ris can grow (when there are suffi- 
ently few F 1s), eventually flexible agents 
‘Ominate as the system converges to (ri, fi, 
fo. fo) = (0, b, 0, 1 — b). To understand the 


BS g 
iy 
M1). Ae E 


1 — bri)?) 


(1 = b(ri)’ 





52°C) =0. 
1)/b] <0 


Ij. o “a 


4g Ut understand how Rls are driven out. As 
5), the phase 


feescribed above, the key to Rls surviving is 
ire 3. The two: flexible agents who lose their proficiency 
presented by (ig, *CHon 1 have a difficult time regaining it. 
(b, 0). (r, fi #,°Wever, when there are sufficiently few Rls 
| the Population, a flexible agent who loses 


|) represents a aye lation, 
ally stable. ciency in action 1 has a reasonable chance 


pas 


ar 
aa 


abounded (see Figure 1). This is interesting . 


iency in action 1. Rigid agents can then dom- . 


ng-run domination of flexible agents, one . 
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FIGURE 4. BOUNDED Memory SYSTEM: 
BASINS OF ATTRACTION 


of avoiding an R1. This gives a flexible agent 
the opportunity to successfully regain profi- 
ciency in action 1. In particular, this is 
achieved by an FO meeting his own kind. With 
this steady replenishment, the proportion of 
F ls is consistently high when there are ini- 
tially few Rls. By facing a perpetually inhos- 
pitable population, Ris are driven out so that 
flexible agents flourish. 

To characterize the basins of attraction of 
the two asymptotic attractors, numerical meth- 
ods were used. In contrast to Figure 3, I allow 
ro + 0. The basins are defined over the space 
of initial populations where an initial popula- 
tion is represented by (ri, rå). Simulations 
showed that the basins partitioned the initial 
state space. These results are shown in Figure 
4 where all points between a jagged line 
(where the associated value for b is noted) and 
the diagonal comprise the basin for the as- 
ymptotic attractor with all rigid agents, and all 
points to the left of á jagged line comprise the 
basin for the asymptotic attractor with all flex- 
ible agents. Note that if the initial population 
is biased towards flexible agents then this bias 
is magnified so that eventually nedrly every 
surviving agent is flexible. When instead the 
initial population is biased towards rigid 
agents then the system converges to the as- 
ymptotic attractor with all rigid agents. As one 
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increases the frequency with which action 1 is 
the appropriate response to the environment, 
the basin for the rest point with all rigid agents 
expands. If action 1 is made more effective 
then it is more likely that agents who exclu- 
sively use that action will dominate. This re- 
sult can also be interpreted as follows. As b 
gets closer to 1/2, the environment becomes 


more volatile in which case being flexible is ` 


relatively more, advantageous. Hence, flexible 
agents are more likely to dominate in that case. 


V. Concluding Remarks 


Environments routinely change and with 
them changes what is required to be well 
adapted. The extent of behavioral response to 
an altered set of surroundings is a crucial form 
of variation among organisms in the animal 
kingdom and would seem central to the deter- 
mination of which organisms thrive and which 
perish. In spite of the importance of behavioral 
plasticity in biological systems, its implica- 
tions for social systems have generally gone 
unexplored. 

In investigating this issue, I assumed 
flexible agents—by choosing the action best 
suited for the current environment—maximize 
the probability of surviving today, while rigid 
agents — by always choosing the same action— 
become relatively proficient with a particular 
action. A key assumption is that an agent’s 
current performance (where current survival 
depends on relative performance) is deter- 
mined primarily by his current action and sec- 
ondarily by his proficiency with that action. A 
central finding is that rigid agents can thrive 
even when experience and the proficiency it 
generates counts much less in determining cur- 
rent survival than doing what is right for the 
moment. It was also found, unsurprisingly, 
that flexible agents do better when proficiency 
requires less experience. More interestingly, if 
there are initially enough rigid agents present, 
rigid agents will dominate even when profi- 
ciency is achieved with a trivial amount of ex- 
perience. While having current performance 
be more heavily influenced by one’s action 
than one’s proficiency biases the model 
against rigid agents, a feature of the model that 
favors rigid agents is having an agent’s current 
survival depend only on current and not past 


‘advance and the remainder drop out and 













performance because, over all types of ey 
ronments, flexible agents generate higher 
erage performance in that they always cho 
the best action—and that is what primarily 
termines current performance. Relatedly, 
other feature of the model that would seen, 
favor rigid agents is the ‘‘up-or-out”’ struc 
in which agents who perform relatively b 


longer compete. If an agent who fails to 
promoted is not immediately kicked out of 
system, the higher average performance 
flexible agents may result in them doing be 
Indulging in some speculation, this sugg 
that rigid agents might do relatively better; 
electoral systems than in corporations in ie subpop 
the former are characterized by an up-orgk T performing 
























rent than past performance. It is this type t anis 
insight that I am striving for and which We “ay 


system relate to the behavioral types that thri hey detrim 
in such systems. t- form bette 
In light of the simplicity of our model, them in h 


is natural to wonder about the robustness tf gues, for 
our results. To begin, one needs to propery’ f 
pose this issue. If the model were to be ge į 


many actions, and many behavioral types,E that has t 
certainly would not expect purely rigig f: presumed 
agents to thrive. They are an extreme ty: fectly dey 
that perhaps only does well in the extrem% vious th 
model. The pertinent issue is instead whetht, j actions a 
relatively rigid agents do well. Robustnesg: igenis th 
then pertains to whether the identified fore: :: The ot 
would still be operative in a more geneng; | search w 
setting and thereby contribute to the deté : J ‘tnulate ir 
mination of which type prevails. I suspé ie selection 


that consistency of action would continue bg; ‘of Agents 
contribute to rigid agents surviving, the 
process by which flexible agents lose pro 


ciency would continue to result in note orld sy 


describe 
curve for achieving proficiency would oot 
tinue to promote flexible agents. d 

While having more environments and #4 
tions might seem opportune for fiexib 
agents, let me put forth some speculative 
guments as to why this might not necessa 
be so. Suppose there are n environments% 
actions, and each action is best for some #f 
vironment. In that they always use the s#§ 
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* on, there'are n types of fully rigid agents. 

a fully flexible agent using a different 
ion for each environment, there are n ! types 
+f fully flexible agents. Let me further suppose 
al m actions are highly detrimental in any 
‘environment other than the one for which they 
‘are best while the other n—m actions do rea- 
sonably well in all environments. If each pos- 
sible rule is initially equally represented then 
E-a proportion (n—m)/n of purely rigid agents 
MES; voids those actions that are highly detrimen- 
ME jal in most environments while only a propor- 
tion L/n(n — 1) (n — m + 1) of fully 
xible- agents do so. Even if the best rule is 
e that is fully flexible, one could imagine 
the subpopulation of purely rigid agents out- 
performing, at least for a time, the subpopu- 
lation of fully flexible agents. Another factor 


(i 
are detrimental in certain situations may per- 
orm better then a flexible agent who includes 
them in his repertoire. As Heiner (1983) ar- 
s, for a bigger behavioral repertoire to be 
dvantageous, an agent must be more skillful 
al recognizing the environment for which an 
ction is appropriate. This is a consideration 
that has thus far been ignored in that I have 
presumed that a flexible agent is able to per- 
ectly deploy his strategy. It is then not so ob- 
ious that more environments and more 
ons are necessarily conducive to flexible 
nts thriving. 
The objective of this particular piece of re- 
Search was a modest one—to begin to accu- 
Mulate insight into how a class of competitive 
ý hselection processes selects among a population 
ould continuĘ ðf agents who differ in terms of their rigidity. 
surviving, 4 ; Ahe ultimate goal of this line of research is to 
€nerate predictive statements about real- 
orld systems. One class of statements would 





in the ext 
instead whe 
ell. Robus 
identified fo: 



















. quicker lea : 
ency would cMWithin a hierarchical social system. For ex- 
gents. aemple, should more ‘tyes men” be expected 
ronments and 3$ be found in middle or upper management? 
lune for fie contrast, statements across social systems 
ücem how the overall rigidity of a social 
ystem varies with such characteristics as the 
umber of levels and the volatility of the en- 
, ament, For example, do ‘‘yes men’’ have 


rays use the Smaller presence in flatter organizations? 


escribe how rigidity varies across levels _ 
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While the model of this paper is far too simple 
to be a reasonable representation of such a 
complex system as an electoral system or a 
corporation, the hope is that later generations 
of models will be. : 


APPENDIX 


PROOF OF THEOREM 1: 

The proof is comprised of three steps. First, 
as £ > œ the proportion of F 1s is shown to go 
to zero (Lemma A1). Lemma A1 is then used 
to show that the proportion of Rls is not 
bounded..below 1 (Lémma A2). It follows 
from Lemmas A1 and A2 that T can be chosen 
so as to make fi arbitrarily close to zero 
Vt = T and rj arbitrarily close to one. From 
this result, it is argued that ri remains arbi- 
trarily close to one Vt > T. 


LEMMA AI: lims fi = 0. 


PROOF: 

Using (1)-—(2), it is straightforward to 
show that b < 1 implies that fi/rį is strictly 
decreasing int. Since fi/r\ is strictly decreas- 
ing and fi/ri is bounded below by 0, lim,... 
fi/r{ exists and is nonnegative. It follows 
that: 


(Al) lim[ (fit / rit) Fi rid] = 0. 


Using (1)-(2), it is straightforward to show 
that (A1) is equivalent to: 


(A2) 

lim <ii. 

imari + bfi + 2brh + 2pb(1 -ri -fi — ri) 
=0. 


Given the denominator is bounded below one 
and b < 1, it follows from (A2) that lim,... 


fi; = 0. 


LEMMA A2: If pb > 1/2 then there does not 
exist F < 1 such that ri < rforall t. 
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PROOF: 
Suppose not so that 37 < 1 such that ri < 
F Vt. Using (1), it is derived that: 


(A3) Ari=ri"' -ri 
~ = ri[(2pb — 1)(1 ~ ri) 
— b(2p - 1) fi 


+ 2b(1 — p)ro].- 


By Lemma Al, lim, — b(2p_ =i)fi= 

By supposition, ri < 7rVt which implies, with 
pb > 1/2, that (2pb — 1)(1 — ri) > (2pb — 
1)(1 — F) > 0 Vt. From these two properties, 
it follows that 3 finite T such that Ari > 


(2pb — 1)(1 — F) Yt = T. But if ri is in- 


creasing at a rate bounded above zero for all 
t = T then lim,... ry = +œ which is not pos- 
sible since, by construction, ri € [0, 1] Vz. 
Given that this contradiction was derived un- 
der the supposition that Lemma A2 is false, it 
is concluded that Lemma A2 is true. 


With the aid of Lemmas A1 and A2, I will 
now prove Theorem 1 by showing that: Ye > 
0 J finite T such that ri € (1 — e, 1) Yt = T. 
By Lemma Al, 3 finite t’ such that fi < 
[(2pb — 1)/b(2p — 1)](e/2) Vt = t’, where 
e > 0 (note that pb > 1/2 implies [(2pb — 
1)/b(2p — 1)] > 0). It is an implication of 
Lemma A2 that V finite t, 3 finite T = t such 
that rf > 1 — (e/2). Setting t = t’, I conclude 
_ that 3 finite T such that fi < [(2pb — 1)/ 
b(2p — 1)](e/2) Vt = T and rf > 1 — (e/ 
2). The next step is to derive sufficient con- 
ditions for Ar į > 0. Since ri € (0, 1 — [b(2p — 
1)/(2pb — 1)] fi) implies Ar; > 0 and given 
fi < [(2pb — 1)/b(2p — 1))(e/2) 
Vt = T, I conclude that if ri € (0, 1 — (e/ 
2)) then Ari > 0 Yt = T. Next, a lower bound 
on Ari is derived. From (A3) and p = 
1/2b > 1/2, Ari > —b(2p — 1)ri fi > — 
bfi. Ift = T then -bfi > —b[(2pb — 1)/ 
b(2p.—.1)](¢/2) = - (2b =.1)(e/2) > —e/ 
2. Therefore, Ari > —e/2 Vt = T. 

To summarize, I have shown that Ve > 03 
finite T such that: (i) rf > 1 — (e/2); (ii) if 
ri, € (0, 1 — (e/2)) then Ari > 0 Yt = T; 
and (iii) Ari > —e/2 Vt = T. By (i) and 
(iii), 77+! > 1 — s. Now suppose that t > T 


















andr; > l~e. Frie (1 — es, 1 — (ey 
then, by (ii), Ar; > 0, which implies r{*!\§ 
ri and therefore rit) > 1 — e. If ri > 18 
(e/2) then, by (iii), rit! > ri — (8/2) a$ 
therefore r{*' > 1 — e. I have then shown tyf 

r{*'> 1 —eandifr) > 1 —e then rit iia. 
1 — &,t = T + 1. By induction, it is concludgk- 
that ri > 1 — e Yt > T. Since € was arbitran} 
this proves Theorem 1. : 


PROOF OF THEOREM 2: : Í ion best fi 
Using (1) and (3), after a few steps one caf haye not. 
derive: Bir 


7 = prop: 


(A4) Ar} + Arg hee ee aTe r 
r and 

=(1-ri-ro)[(2pb-1)ri fs envi 

| Fi. prop 

+ (2p(1 — b) — 1)r]. Da arel 

and 

Since p = 1/2b then Ari + Aro < 0. Givat. (#i) 
that ri + rb is monotonically decreasing an f 
has a lower bound of zero, lim,.. Ari +! Note that) 


Ari = 0. Since (1 — ri — ro) is bounds} derive the 
above zero Vt, it follows from ( A4) tha} 
lim,.. Ari + Aro = 0 iff lim... ri = 0 amp (A9) Fi 
lim,. 76 = 0. i 


PROOF OF THEOREM 3: 

It is straightforward to show that: 
-fi k, 
= fil (2b - D0- ri) -fil f 


(AS). Bri ssi =a = + 


(A6) Afi 
By the same method used to prove Lemma Alg , 
one can establish that lim,.... fi | = 0. By (AMR. “or. 


ri is monotonically increasing. Since ri ] f 
ch 


implies lim, Ari = 0. Since lim, + fi” 
and r į is increasing then lim,- ri = 1. 


PROOF OF THEOREM 4: _ 
It is straightforward to show that: 


(A7) Ar} =r{(2b-1)Q1 - ri -fi) 


(A8) Afi =fi(2b-Id—-ri-fie§ 













ei + fi are monotonically increasing 
z pounded above by one then ri + fi has 
A E simit so that lim,..(Ar; + Afi) = 0. Since 
Wee + afi = (22 - DG + fA -ri 
then lim,-.(Ari + Afi) = 0 implies 
a(t f= — 


ing t 
LENAT 1 
a4 
3 


ROOF OF THEOREM 5: 
The first step is to partition rigid agents 
to those who have always chosen the ac- 


Wion best for the environment and those who 
have not. 


s A = proportion of the level ż population that 

; are rigid agents endowed with action i 

iE" and who have always faced a type i 
(2pb — 1)ri$. environment. 

uy] = proportion of the level r population that 

-1)ro]. % are rigid agents endowed with action i 

ve previously faced a type j 

Ar; < 0. Gig! and who have p y faced a type j 

y decreasing $ 


(+i) environment. 
> lim» AriffNote that r; = F; + ^t. It is straightforward to 
'ro) is bouti derive the equations of motion for t = 2: 
from (A4) i: 
Mie ri = 0R(A9) Fit! = FiL2bCF + Fo) 


















í 
F. +2(1-b)A +F! 
E P 

j E = gt 2 

w that: i ELE bñ), 

1= ri = fiPECAI) it! = M [2b — bô — bfi], 
a 

a-r- fig 
SRAL) fit! =fi[2b — bñ — bfi], 


prove Lemma i. 
fi = 0. By (Ai 
sing. Since ri§(A12) 


pee am OT rel2(1 — br, + Ai) 


i ee a es a 
e ae + 2bôh + Fh] + b(A), 
moti = 1 W 


WAB) Ai ARa — by -— (1 — by A 


4: | 
10w that: rt a l 
ha E - (1 — fs], 






i-ri- t) a 
é £ 14) fit! = f2 — b) — (1 — b) 


E =t bfi: 


a-ri -fi) 





HARRINGTON: FLEXIBLE AND RIGID AGENTS l ‘ -79 


LEMMA A3: lim,.. ro = 0. 


PROOF: 
Since Afo = fol — (2b — 1) —- (1 — b) Fo — 


- ¢1 — b) fo] <0, itis easy to show that lim,... 


fo = 0. Using this fact then a little manipula- 
tion shows: lim,... AF} = lim,..—Fo{1 — rb — 
2(1 — b)(Fi + ĵi)] = 0. From this it is con- 
cluded that lim,... Fọ = 0 and thus lim,.. 
ro =0. 


LEMMA A4: lim,- fi = [(2b — 1)/b] 
[rv -ril=p 


PROOF: 

Since f{*! = #2 JJL, [2b — bf] — bff] and 

fit! = fi Tie. [2b — bfi — bfi] then 
fit fit! = Fi/f?. It is straightforward to de- 
rive: 72 = bri(1 + rj) and fi =b(1-r!-— 
r})(i + r$). It is then obtained that f\/f{ = 
ri/(l—rj—ro) Vt = 3. Substituting [(1 — 
r|—r4)/rj JF for fi in (A10), the following 
is derived: 
(A15) #t'=A[2b-b((1—rdb)/ri)r]. 
Using (A15), it is easy to show that Af, 2 0 
as fi = p. If ĵi has a limit, it must then be A. 
ĵi converges to ô if |F\*' — pl < |A —- Al. 
This will only be shown for fi < #, as the 
proof is analogous for fi > p. Given fi < p 
and therefore 7\*' > ĵi, if fit! < ĝ then I am 
done. Suppose instead that 7{*' > ĝ. Using 
(A15) and substituting ((2b — 1)/p) for 
b((1 — r})/r}), one derives that |7{*! — 
ô| < |f — p| if and only if: 


(A16) (#1) = 2p —- (2b + 1)F; 
+ (2b -1)(1/p)(F\)? > 0. 


Since y'(p) < 0, and y"(-) > 0 VF; then 
(Fi) > pC) VFL < p. Given (Ô) =0, it 
can be concluded that (A16) is true. Hence, 
lim,o fi = ÔÊ. 


LEMMA A5: lim,» Fi = (1/2){1 - 2(1 
b)p — [1 -4(1 — b) ô — 4b(1 — b)p7]'"} 
7. 


Io: 
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PROOF: 
Since lim,.. rg = 0 and lim,... fi = Ô, it 
follows that: 


(A17)  lim| (077 '/0F1) 


— [2(1 — b)p + 271] | =0. 


Thus, as t > ©, (@7(7'/0r,) < 1 iff ri < 
(1/2) — (1 — b)p. It follows that if ri < 
(1/2) — (1 — b)p as t> œ then Fi converges. 
Furthermore, it can be shown that, as tf > ©, 
Ar 2 0 as lim,.... 7; = p. Hence, if F con- 
verges it must converge to p. Given that p < 
[(1/2) — (1 — b) ô] then, as t> œ, Ari <0 
when Fi > (1/2)M[1 —2(1 - bp). Therefore, 
4 T such that Vt = T if Fi > (1/2)f1 —- 
2(1 — b)p] then Ar < 0. This implies 3 T’ 
> T such that r7 < [(1/2) — (1 — b)ĝl. 

Since then |(O7F;7'/Or,)| < 1 Vt = T’, it 
follows that ri converges to p. 


I now want to show that r} +r} > f+). 
Substituting (1 — r4)(b/(2b — 1)]@ for ri 
and using the definition of p, rj + rg >—p+ 
P is equivalent to: 

(A18) (1—r})[b/(2b-1)]6 +r} 
>pt(1/2) {1-201 —b)p 
-{1-4(1—b)p 
—4b(1—b)p?]"""}. 
The left-hand side can be rearranged to: [b/ 
(2b — 1)]— + [1 — (b/(2b — 1))plro; and 
it is straightforward to show that [1 — (b/ 
(2b — 1))ô]ri > 0. Hence, (A18) holds if: 
(A19) [b/(2b-—1)]p 
> p+(1/2){1-201 —b)p 
—[1-4(1 = bp 
— 4b(1 — Yl el j 
Working through a few algebraic steps shows 


that (A19) is equivalent to: (2b — 1)/b > ĝÎ, 
which is indeed true. 


; Simplifying, (rf, 90,1 —- 


PROOF OF THEOREM 6: ae 
Since an asymptotic attractor must be a y 

point, the set of rest points is first derived gf 

then their asymptotic stability is evaluated, i 





PROPOSITION Al: (ri; fi, ro, fo) isan ot ice an ee geal 
point if and only ifr fis Poo fo) € (C Off Solving (23 
0), (0, 0, 1, 0), (0, b, 0, 1 — b), (1 = by 


1)/b)?< 
(2b — 1)7/b?, 0, (1 — b)(2b — 1)7b*)¥° Oi for w 


2b — 1) 
PROOF: | b/b ( 


The proof is by construction. Substitutiyt ; To prove tha 


1 -ri — fi — fo for rg in (10), (11), au} iotic attractor, 
(13), the dynamical system can be representy } =rhf (2b — 1)( 


by: pri e (0, 1 


(A20) Ari =ri[(2b— DA- ri- oft arene 
(1 - b)(2b — 
tractor, it will | 
(A21) Afi =fil(2b — 1) — bri — bfi) Ẹ hood of this t 
í; points such thz 
+ fo[2b(1 — ri) ; to this rest poin 

k that: ri>(l- 

— 2bfi — bfo], 

EE (a25) [2 

(A22) Afo=fol—b + (1 — d)ri ] > 
EN -Ê Note that the « 

ý equal (2b — 

+ fi[2Q — b)r! d Therefore, by 

6 (L- b)/b an 
+1 -bpf]. $- slate can be n 

( fi] É. point. I want 
Since r! exce 

(ofis 1 = ri — fi — fo, fo) is a rest pointi. Would follow 






and only if (A205 (A22) equal zero. If ni From (A2C 
1 then 7; = fı = fo = 0. Since (A20)-(AMe r| < 1 — [b 
equal zero, (0, 0, 1, 0) is a rest point. Givi left-hand side 
Aro = —ro[(2b — 1)(ri + fi) + bfo] ths AE 
Ari, < 0 when ri < 1. Hence, if (r;, fi, tk: To= 
Jo) is a rest point and rọ < 1 then rọ = 0. § b(Ï 
Since ro = 0 at all other rest points, a %Ẹ 
point is defined by (r, fı, fo) such that Ari 
0, Af; = 0, Aft = 0, and r; + fi + fo= 
Substitute 1 — r, — fı for fo in ee (aa 
— fi) is a 
point if and only if (7,1) satisfies: 








(A23) nib- 1) — 1) — bf] =9F 


(A24) b-f,(1 — bri) — bri(2 — r) = 








= 0 then (A23) is satisfied while (A24) 
satisfied by fi = b. Hence, (0, b, 0, 1 — b) 
: f a rest point. Now consider 7; = 1. Since then 
is evaluate ERS 2 0, (A23)—(A24) are satisfied so that (1, 
4 4,0 0) is a rest point. The final case to con- 

sider are rest points for which r, € (0, 1). 
solving (A23) ~- (A24), one derives fi = 
(2D = 1)/b)? and n= 1- b)/b. The unique 
iW rest point for which r, € (0, 1) is then ((1 — 
Ib, (2b — 1)?/b?, 0, (1 — b)(2b — 1)/b?). 













SE: To prove that (0, O, 1, 0) is not an asymp- 
tic attractor, (12) is used to derive Aro = 
6((2b — 1)(ri + fi) + bfo]. Since Aro < 

wri, € (0, 1), it follows that lim,... ri = 


<i. 

To prove that ((1 — b)/b, (2b — 1)?/b?, 0, 

1 = b)(2b — 1)/b*) is not an asymptotic at- 
§Biractor, it will be shown that in any neighbor- 

— bri — bf hood of this rest point, there exists a set of 
WE joints such that the system does not converge 

“to this rest point. Consider an initial state such 

(Bihar! > (1 — 5)/b, ri = 0, and 





25) [(2b-1)/b](1-1ri) 


>f! >b(1 —r})?/(1 — ori). 
of ite 

i Note that the expressions on either side of fi 
equal (2b — 1)*/b? when rj = (1 — b)/b. 
‘Therefore, by choosing r | sufficiently close to 
(l - b)/b and having fi satisfy (A25), this 
state can be made arbitrarily close to the rest 
Point. I want to show that Ari > 0 Vt = 1. 
Since r| exceeds its value at the rest point, it 
uld follow that the system diverges. 

From (A20), one knows that Ar! > 0 iff 
<1 — [b/(2b — 1)]f}. Rearranging the 
-hand side of (A25), one finds that r! < 


















Fi) + bf ol Spit — [b/(2b — 1)]fi. Hence, Ari > 0. When 
nee, if (rı, fifo = 0, one knows that Af! > 0 iff fi > 
| then ro = 0.4M@(1 — ri)?/(1 — br!). Thus, by the right- 
rest points, 


) such that A : 


26) (rt, ft fS) 
E 


ri — fi) is 


—b(fi + 2fo) 
(1 —b)(2fi + fo) 
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hand-side inequality in (A25), it is concluded 
that Af! < 0. I next want to show: if Ar) > 
0 and Afi < 0 then Ar{*! > 0 and Afi"! 
< 0. If Ari > 0 then rj < 1 — [b/(2b — 
1)]fi so that ri = 1 — [b/(2b — 1)]fi— e 
for some £ > 0. Using (10), it follows that: 
rit! =] — [b/(2b -1)] fi — e[2(1 — b)+ 
bfi + e]; and therefore rit! < 1 — [b/(2b 
— 1)] fi. Since, by supposition, A fi < 0 then 
fit! < fi. It follows from rit! < 1 — [b/ 
(2b — 1)]fi that rit! < 1 — [b/(2b — . 
1)]f{*!. Itis concluded that Ar{*' > 0. Now 
I turn to show that Af{*' < 0. Since rh = 0 
and Afi < 0 then fi = [b(1 — riy -: 
br})] + e for some e > 0. From (11), f{*! 
= [b(1 — ri)?/(1 — bri)] + brie so that 
fit’ > b(1— riy (1 — bri). Since Ari > 
O then rit! > ri. Given that b(1 — r{)?/ 
(1 — bri) is decreasing in r{, it follows that 
fit! > bd — ri*')7/C1 — brit!) and thus 
Afi! <0. 

It has been shown that: (i) Ar! > 0 and 
Af! < 0; and (ii) if Ar; > 0 and Afi <0 
then Ar{*! > Oand Af{*! < 0. By induction, 
Ari > O and Afi < 0 Vt = 1. It is inferred 
from r! > (1 — b)/b and Ari > OVt= 1 
that lim,.. ri > (1 — b)/b. This establishes 
that ((1 — b)/b, (2b — 1)?/b?, 0, (1 — b)(2b — 
1)/b?) is not an asymptotic attractor. 

The final step is to show that (1, 0, 0, 0) 
and (0, b, 0, 1 — b) are asymptotic attractors. 
Substituting 1 — ri — fi — fo for ro in (10), 
(11), and (13) and defining II (fi, ro, fo) to 
be the matrix of first derivatives, it is straight- 
forward to derive (A26) below. 

Let A(ri, fi, fo) = max { M, ..., A} where 
Ay, «+ » Ay are the eigenvalues to I(ri, fi, 
fo). By Theorem 4.10 in Kelley and Peterson 
(1991), if Ari, fi, fo) < 1 then (ri, fi, fo) is 
an asymptotic attractor. Using (A26), one can 
show that A(1, 0, 0) = max{2(1 — b), b, 
1 — b} < Land A(0, b, 1 — b) = max{b(2 — 


~br} 0 
b(2-ri—2fi—-2fo0 2b(1-ri-fi—fo) | 
(1-b)(2r,+2fit+fo) G-b)d+rit+fi) 
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b), 0, 1 — b?} < 1. Therefore, (1, fi, To, 
fo) = (1, 0, 0, 0) and (7, fi, To, fo) = (0, b, 0, 
1 — b) are asymptotic attractors. 
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Incentives $: The trade-off between efficiency and income 
American W distribution plays a central role in analyzing tax 
986, 76(4),MBrolicy.' The modern framework for analyzing 

‘Aithis trade-off using nonlinear income taxes was 
s of Thumh@¥éreqted in James A. Mirrlees (1971). While 
omic Behavigithis formulation crystallized a presentation of 
er 1993, 22(IBihe income tax problem and derived some of 
{Æe properties of optimal income taxation, the 
E. “The Quiiimplications for policy have been somewhat 
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timal Income Taxation: An Example with a U-Shaped Pattern 
of Optimal Marginal Tax Rates 


By PETER A. DIAMOND * 


Using the Mirrlees optimal income tax model with quasi-linear preferences, the 
paper examines conditions for marginal tax rates to be rising at high income 
levels and declining in an interval containing the modal skill. It examines con- 
ditions for the marginal tax rate to be higher at a low skill level than at the high 
skill level with the same density—an argument only holding for skill levels above 
a cutoff where resources of a worker are marginally of the same value as re- 
sources of the government. Data on earnings rates are presented. (JEL H21) 


tion are of little or no relevance for policy. This 
paper argues that the case for nonconstant and 
high marginal tax rates in the Mirrlees model 
is considerably stronger than has been realized. 


' The technical contribution of this paper is very 
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modest, being primarily a rearrangement of 
terms in the standard first-order condition for 
optimal income taxation. This rearrangement 
leads to a different way of approaching the 
combinations of assumptions that will sign the 
change in marginal tax rates with income level. 
In addition, by concentrating on the case where 
there is a zero income derivative of labor sup- 
ply, the intuition behind the first-order condi- 
tion becomes clearer. 

Section I reviews some of the previous lit- 
erature. Section IJ presents the optimal income 
tax problem. Section II examines conditions 
for marginal tax rates to be rising at income 
levels above the modal skill level. Section IV 
examines the level of marginal tax rates on 
very high incomes. Section V examines con- 
ditions for the marginal tax rate to be declining 
and to be higher at a skill level below the mo- 
dal skill than at the skill level above the mode 
with the same density of skills. These argu- 
ments apply for skill levels above a cutoff 
level, where resources are of the same value 
in the hands of the government and in the 
hands of a worker with the cutoff skill level. 
Section VI considers another example. Section 
VII looks at data on earnings rates to suggest 
the relevance of alternative empirical assump- 
tions on the distribution of skills. Some closing 
remarks are in Section VII. 
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I. Review of the Literature 


Formal results about the optimal income tax 
are fairly limited. (For recent expositions, see 
Matti Tuomala, 1990 Ch. 6 or Gareth D. 
Myles, 1995 Ch. 5.) Assuming that labor sup- 

_ply can be continuously adjusted, there is no 
gain from having marginal tax rates above 100 
percent since no one will have such a tax at 
the margin. That is, the same outcome can be 
achieved with taxes no greater than 100 per- 
cent. It is usually assumed that preferences are 
such that consumption is an increasing func- 
tion of the wage. Then, earnings will be non- 
decreasing in skill. It then follows that the 
optimal tax structure has nonnegative marginal 
rates (Mirrlees, 1971) and positive rates in the 
interior of the income distribution (Jesus K. 
Seade, 1982). 

Assuming that there is a finite maximum to 
the skill distribution, the marginal tax rate 
should be zero at the income level of the top 
skill (Efraim Sadka, 1976; Seade, 1977). The 
argument for this result is quite intuitive. As- 
sume this were not the case, then, extending 
the tax function to higher incomes with a zero 
tax rate would lead the top earner to work 
more, raising social welfare without losing 
any tax revenue. However, this condition 
need not convey information about optimal 
taxes over any significant region of incomes — 
the optimal rates need not approach zero until 
very close to the top. This point has been 
made by the numerical calculations in 
Tuomala (1984). , 

At the bottom of the skill distribution, in the 
presence of optimal taxes, there may or may 
not be an atom of individuals doing no work. 
If everyone works, then the argument for a 
zero marginal tax rate carries over (Seade, 
1977). However, if there is an atom of non- 
workers, the optimal tax has a positive mar- 
ginal tax rate at the level where earnings begin 
(Udo Ebert, 1992). This latter case seems em- 
pirically more relevant. 

In addition to these analytical results, pre- 
sentation of the first-order condition for opti- 
mal taxes has generally been accompanied by 
observations on the factors leading to high or 
low rates, ceteris paribus. Considerable effort 
has gone into simulations, starting with that by 
Mirrlees. In his simulation Mirrlees assumed 


























tax schedules.’’ As seen in the survey 2 
Tuomala (1990), similar results followed w 


icantly inverse-U shaped pattern (e.g., Re 
Kanbur and Tuomala, 1994). As will be cle 
ified below, simulation results are sensitive 
both the utility function and the family of dg 
tributions of skills assumed, opening up 
possibility of different conclusions. 


If. Optimal Income Tax Problem 


The Mirrlees optimal income tax problemi; 
the maximization of the integral over the py. 
ulation of a concave function of individi =~: 
utilities, subject to an aggregate budget cng. 
straint and subject to the constraint that int} 
viduals optimize in their choice of la 
supply given the relationship between w 
and after-tax income. The only differenti bal 
across individuals-in the model is a differemg. é 
in skills, with an individual of skill n having: <I 
marginal product equal to n. The model isi ist 
one-period model with only labor income|— go 
is assumed that the’ government can obse 
income received but not hours worked or sky 
Denoting consumption of someone with sig 
n by x(n), labor (in percentage terms) & 
y(n), and the concave utility function by u% 
y), the social objective function can be s 
as 


(1) | G{u[x(n), y(n) ] } f(n) dn, o 


no 


where G(u) is an increasing and strictly “§ 
cave function of utility, with G independent% 
n, and the distribution of skills is writtet 
F(n), with density f (n). It is assumed that*$ 
distribution of skills is single-peaked, wit% 
mode at nm. The density is assumed to be 
itive and continuous between the bottom 
the top skill levels, n and n,. 
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e resource constraint on this maximi- 
is labor sy 


on can be stated in terms of output—that 
i onsumption be less than aggre- 


h sregate C 
t 0), and a} pee production-minus government_expen- 
E eres. E: 





Í " x(n) f(n) dn 


no 


em (e.g, Rae =Í ny(n)f(n)dn — E. 
are sensitive ' 
e family of. 
opening up 

sions. 


constraint can be stated alternatively in 
rms of taxes. Denoting taxes as a function of 
earnings as T[ny(n)], consumption equals the 
$ difference between earnings and taxes, x(n) = 

y(n) — T[ny(n)]. In this case, the govern- 
ent budget constraint is that taxes cover gov- 


Problem 


e tax proble 
ral over the 
n of indivi 
ate budget 
nstraint that 
choice of lak 
p between wa 
only differe 
fel is a differi 
of skill n having 
. The model i$ 
| labor incom 
ment can obsa 
ts worked or. 


That the resource constraint can be stated 
€quivalently in terms of government budget 
balance or in terms of aggregate supply and 


vernment observes earnings, not hours 
tked or skill. Thus the government is re- 
icted to setting taxes as a function only of 





meone with $ tarhings. The incentive compatibility con- 
antage terms), Rtraint is that the selected labor supply, y(n), 
‘function by MgMaximizes utility, given the tax function, 


Mlay(n) — T[ny(n)], y(n)}. The relevant 
of the tax function is just the part that is 
stiected by someone—taxes can be set arbi- 
ey high at earnings levels that no one 
;0ses with the optimal tax structure. Thus 
* Mcentive compatibility constraint.can-be 
ped in the familiar form that a worker with 
Bru! n does not prefer to imitate the. earnings 
Worker with a different skill level: 


for all n and n’. 
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That is, someone of skill n would have to work 
n'/n times as much as someone with skill n’ 
in order to have the same earnings level. 

This paper will concentrate on the special | 
case where there are no income effects on la- 
bor supply.” That is, it is assumed that utility 
is linear in consumption (referred to as quasi- 
linear): 


(5) u(x,y)=x+v(1 —-y) 
= ny — T(ny) +u( — y), 


where v is assumed to be strictly concave.’ 
This assumption seems appropriate at very 
high income levels, since people at the top of 
the income distribution are likely to leave large 
estates— with a linear utility of bequests, nei- 
ther consumption nor earnings vary with the 
exact level of estate. (The receipt of such be- 
quests is not part of the model.) In addition, 
this assumption removes a source of consid- 
erable complication in tax analysis. In the 
presence of distorting taxes, income effects 
imply that lump-sum taxes have efficiency ef- 
fects since they change distorted labor supply 
decisions. 

This problem has some complexity in the 
derivation of the first-order condition for an 
optimal tax function, but is familiar from a 
number of mechanism design problems.* The 
simplest way to proceed is to replace the in- 
centive compatibility conditions, (4), with the 
first-order condition for individual choice, 
which, from (5), can be written: 


(6) v'[1—y(n)] =n {1-T'[ay(n)]}, 


where T” is the marginal tax rate. For later use, 
it is convenient to note that for the quasi-linear 
utility function the elasticity of labor supply 


= wå t- 


? For a discussion of this case with a constant elasticity 
of labor supply, see Anthony B. Atkinson (1990). The 
complementary case where utility is linear in leisure has 
been studied; see Stefan Lollivier and Jean-Charles Rochet 
(1983) and John A. Weymark (1987). 

7I assume that G’[v(1)] is infinite, so that someone 
doing no. work is given positive consumption and the non- 
negativity constraint on consumption can be ignored. 

* For an exposition of the mechanism design problem, 
see Drew Fudenberg and Jean Tirole, 1991 Ch. 7. 
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evaluated at the chosen labor supply of a 
worker of skill n, e(n), satisfies 


e(n) = —v'[1 — y(n) ] 
+ {y(n)v'[1 — y(n) ]}. 


(7) 


n 


Since the wage equals the skill level, this is the 
elasticity with respect to the wage, evaluated 
at the labor supply level that is chosen by 
someone with skill n. 

More complicated than deriving the first-order 
condition is the problem of checking when the 
first-order condition does indeed characterize an 
optimum. The complication comes from the 
need to check that individual labor supplies sat- 
isfying the first-order conditions are globally op- 
timal choices, and not just the solution to a 
first-order condition. This problem arises since 
the budget set is not convex when marginal tax 
rates are declining over some income levels. For 
any particular economy, one can check whether 
individual labor supplies are optimal. This issue 
raises the possibility that with the optimal tax, 
the distribution of skills results in a distribution 
of incomes that either has bunching at some in- 
come level (an atom of workers choosing the 
same income level) or a gap in the distribution 
of incomes. (Bunching at zero income or a gap 
between zero and the lowest positive income are 
not issues for the interpretation of the optimal 
tax structure below.) I do not explore this issue 
for this particular class of preferences, but pro- 
ceed with analysis of the first-order condition; 


the analysis holds where the equilibrium distri-. 


bution of incomes has no bunching and no gap, 
since generically the equation is a necessary con- 
dition for the optimal tax where this is true. 

` The first-order condition for the optimal tax 
can be calculated by specializing the condition 
in Mirrlees (1971) for quasi-linear preferences 
or deriving it directly, as is done in the Ap- 
pendix. As usually written, the condition is: 


PO- v')f 


=[(v' — yo")/n] 


«|p (p - o") a | ; 


(8) 


„ ual choice, (6), to rewrite (8) as 






where p is the Lagrange multiplier on the gpf 
ernment’s budget constraint and the functiy,# 
v and G are evaluated at the appropriate lap% 
supplies and consumption levels. 

It is convenient to use the elasticity of }y 
supply and the marginal condition for indiyg$ 










§ without the furt 
-stferences. Th 


l l -E gening the cha: 

(9) TL =T") Piar, althous 
A E pf the conditio 

=[(e7' + 1)/n] E jooger depends 


l ely of labor 
; : javolving the s 
% f (p =) ar i. lpf]. q M paa 
‘p be-zero. Tht 
Multiplying and dividing (9) by (1 — FJa gigas are need 
turn the integral into an average term, (9) cep expressions. 
be rewritten as: De 






















IMI. incr: 


(10) T'/(1-T')=A(n)B(n)C(n) bt 
E +I turn now to 
where A(n) =e '(n) + 1; ; condition for | 
: presence of qu 
g ; eral, the variati 
B(n) = Í (p — G') dF/{ p[1 — F(n))} “aly with pares 
i í since taxes wi 
C(n) = [1 - F(n)]/ . j; Pligd and the e 
(n) =[1~ FV Haor suo 
l ; Making for sir 
The analysis below examines these three funy’ stant elasticity 


tions, A(n), B(n), and C(n), under altem tility of leisu 
tive assumptions on the functions e(n), fe: {l= (1 - y) 
and G(u). (ian 

The absence of income effects allows ani. 3° 
tuitive grasp of the factors that determine bY; 
optimal tax structure. Increasing the margt; 
tax rate affecting some skill level involves 
increase in the deadweight burden for peo 
at this skill level. Thus, the optimal magi 
tax rate at some income level depends on! 
elasticity of labor supply at that income 10% 
since this is important for marginal distortié 
Increasing the marginal tax rate also transi. <: 
income from all individuals with higher K- üde 
to the government, without changing the ég: ™ 
tortions of their labor supplies. The weights" 
these two elements depend on the ‘ratio 0 
dividuals with skills above this level to 
viduals with skills at this level and on the! 
of skill which links the tax on hours to the? 
on income. This intuition is displayed in 
tion (10), where the first-order conditios” 
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optimal income tax is written as a product 
three terms. n f 

same approach to signing the change in 

ax rates can be used with the as- 


The 





i nc 
s one the change in the marginal tax rate are 
al 


similar, though the economic interpretation 
the conditions is more complex. A(n) no 
r depends only on the compensated elas- 
ticity of labor supply, but also has a term 
involving the second derivative of the utility- 
consumption, which is no longer assumed 
to be zero. Thus different economic assump- 
ns are needed to sign the mathematical 
ressions. 


resence of quasi-linear preferences. In gen- 
eral, the variation in the elasticity of labor sup- 
ply with skill will depend on the tax function, 
since taxes will affect the level of labor sup- 
plied and the elasticity varies with the quantity 
éf labor supplied. One obvious exception, 


slant elasticity of labor supply. In this case the 
‘utility of leisure satisfies v(1 — y) = c{1 — 
{l= (1 — y)]*} = c(1 — y*) for some con- 


ects allows ase >- 

at determin LEMMA A: Įfv(1 — y) = c{1 - [1 - (1 — 
¥)I‘} = c(1 — y*), then A(n) is a constant. 
level involve 

wurden for pore 
optimal marg 
21 depends of 
hat income Ie 
rginal disto 
rate also tra 
with higher Si 
changing thes% 
ss. The wei 

on the ratio olf 
this level to P% 


With quasi-linear preferences, a uniform 

fer from the government to all workers 
š no effect on labor supply, and so no extra 
pact on the government budget. The welfare 
pact of such a transfer is the average of G’ 
er the entire population. Thus one can con- 
de that the Lagrangian on the government 
dget cónstrairit, p, is equal to the average of 


p= i G'(n)f(n) dn. 


displayed ind i 


rder conditio gg; B(no) is equal to zero. 


DIAMOND: OPTIMAL INCOME TAXATION 87° 


Given the incentive compatibility con- 
straint, utility must be nondecreasing in skill 
and increasing where earnings are positive, 
since a worker can always have the same con- 
sumption as a worker with lower skill while 
doing less work, provided the level of work is 
positive. That is, above the skills at which . 
there is no work, utility is increasing inn. With 
G a concave function, G’ is then decreasing in 
n. Since B(n) is the average of [p — G'] from 
the level n to the top of the skill distribution, 
B(n) is increasing in n.° 

Since p is equal to the average of G’ and G’ 
is nonincreasing, there is a critical value of n, 
denoted nç, at which G’ is equal to p: 


(12) G’[u(nc)] =p. 


If nc occurs at a level of skill where there is 
positive work, then nc is unique; otherwise nc 
is set equal to the highest skill at which there 
is no work. The level of nç is endogenous, 
varying with both the structure of the economy 
and the nature of the social welfare function. 
To simplify the statement of results, analysis 
is restricted to economies where this critical 
level is below the modal level of skill: 
(13) Nc < Mn. 
This seems like the more interesting case, as- 
suming that the mode of skills is near the me- 
dian and the government would like to 
redistribute toward a fraction of the labor force 
well below one-half. 

I note that [1 — F(n)]B(n) is increasing in 
n up to nc and then decreasing in n. These 
results are summarized as: 


LEMMA B: B(n) is increasing in n. {1 — 
F(n)|B(n) decreases inn forn > ne. 


I turn now to the shape of the distribution 
of skills. Given the assumption of a single- 


$ Formally, differentiating B(), the derivative has the 
same sign as the average of (p — G’) from n to n; minus 
the value [p — G’(n)]. Since G’ is decreasing in n, this 
difference is positive. 
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peaked density of skills, nC(n) is decreasing 
in n for n below the modal level, n,,. For values 
of n above the modal level, the shape of C(n) 
depends on the family of distributions as- 
sumed for skills. With a Pareto distribution 
“above the modal skill level, (i.e., the density 
is proportional to 1/n'** for a > 0), then C(n) 
is a constant above the’ modal skiil level. 


LEMMA C: Forn < npa, nC(n) is decreas- 
ing inn. Forn > nm, C(n) is constant if F(n) 
is the Pareto distribution above n,,. 


One can now put these lemmas together to 
identify sufficient conditions for marginal tax 
rates to be increasing with income for incomes 
above the modal level. Where all three of 
A(n), B(n), and C(n) are nondecreasing and 
at least one is increasing, then marginal tax 
rates are increasing. 


PROPOSITION 1: Marginal tax rates are in- 
creasing above the modal skill if, above this 
skill, the elasticity of labor supply is constant 
and the distribution of skills is Pareto. 


With the conditions in Proposition 1, A (n) and 
C(n) are constants, so that T'/(1 — T’) varies 
with n as B(n) varies with n. With B(n) in- 
creasing, so too is T’. The result carries over 
if the elasticity of labor supply falls with skill 
at the equilibrium labor supplies. Similarly, it 
is sufficient to have a distribution of skills such 
that [1 — F(n)]/[nf(n)] is increasing. More- 
over, the result of rising tax rates will hold for 
part of the skill distribution (above the mode) 
if the conditions are met for that part; one does 
not need conditions on the entire distribution. 


IV. Asymptotic Marginal Tax Rates 


With a known finite top to the distribution 
of skills, the optimal marginal tax rate is zero 
at the top of the income distribution. As noted 
in the review of the literature and is clear 
from the argument behind Proposition 1, this 


need not imply that rates approach zero until 


very close to the top. Thus it is natural to con- 
sider the case of an unbounded distribution of 
skills and to consider the behavior of the op- 
timal marginal tax rate as skills rise without 
limit. 











In addition to assumptions on the disty,F 
tion of skills and the elasticity of labor sup i 
the shape of the social welfare of indivi a 
utility, G(u), needs to be examined. One y 
sibility is that the marginal welfare weight 
consumption of those at the top tends to z 












as skill rises without limit®. For example, 92 
is the case in the example in Mirrlees (197 Pie. i oe 
where G = —exp(—bu)/b (b > 0) andy ' 86 
alog(x) + log(1 — y). Similarly, it is the caf S35. Asymptor 
in Martin Feldstein’s (1985) study of sog: iiis with param 
security, where G = u and u = log(x). If 

goes to zero as n rises without limit, then By 

goes to 1. Alternatively, one might assume ty% 

G” has a positive lower bound which is gf fo examine t 


proached as n rises without limit. For exampkẸ ‘of yery high e: 
Atkinson (1990) considers the case of fora, e, and £ 
‘‘charitable Conservative’’ position, wherety®: viel 
marginal welfare weight of consumption take}: Jabor supply; 
on two values—a high one for ‘‘poor’’ peop lying ability < 
and a low one for ‘‘nonpoor’’ people. I deng¥ education is a 
by g the ratio of the lower bound on G” tot} effect, compe 
Lagrangian on the government budget cop elasticities arı 
straint, which is equal to the average of G's compensated 
the entire population. Thus, B(n) convergest tse for illustr 
1 — gas skill rises without limit. £ T im seeking 
Assuming a constant elasticity of labor sẹ} laoking at th 
ply, e, and a Pareto distribution for skil "provides an a 
above the mode with coefficient a, so tW% Vey by John F 
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C(n) equals 1/a, (10) becomes . done for a rar 
; A range of g 
(14) T'I —T')= (e7! + 1)B(n)/a. Ẹ Fór the coeff 
; sing tax dat. 
Solving for T’ and taking the limit as n nis% M Poterba ( 
one has: y tüd 1.5 over 
& COmes of the 


PROPOSITION 2: Assuming a Pareto distig: Me calculati 
bution of skills above the modal skill andl: Possibility of 
constant elasticity of labor supply, as § ¢ He distributic 
rises without limit the optimal tax rate chy: dlues of th 


[fom (15)] 


verges to : that th 
side 1 e 

(15) T'=(e'+1)(1-8) ahe 
m] 


+ [a + (e™! +1)(1 — g)] 


é In this case, the tax rate tends to the rev! 
maximizing rate, since, in the limit, the only effect of 
on welfare is through the budget constraint. 
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67 38 82 
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5 Notes: 


To examine the implications for the taxation 


s the case i$ for a, e, and g. Identifying skill with the wage 
sition, wherelgecields an elasticity based on adjusting hours of 
nsumption tor supply; identifying skill with an under- 
Ai ing ability suggests a larger elasticity since 
’ people. I deh education is also variable. With a zero income 
und on G' tog ffect, compensated and ordinary labor supply 
ent budget ca@elasticities are the same. Presumably it is the 
average of G Feampensated elasticity that one would want to 
guio for illustrative purposes. Recognizing that 
wi am seeking an elasticity for high earners, 
ooking at the elasticity for prime-age males 
provides an approximation. Based on the sur- 
vey by John Pencavel (1986), calculations are 
Sone for a range of elasticities from 0.2 to 0.5. 
‘A range of g from 0 to 0.5 seems very wide. 


4 1)B(n) the coefficient of the Pareto distribution, 
„eting tax data Daniel R. Feenberg and James 
e limit as n i Poterba (1993) find a varying between 0.5 


ieni 1.5 over the years 1951—1990 for the in- 
{emes of the top 0.5 percent of the population. 
e calculations reported below suggest the 

Ssibility of a considerably higher value for 
ë distribution of skills, perhaps as large as 5. 
ggtalues of the asymptotic marginal tax rate 
irom (15)} are shown in Table 1. Thus I con- 





cS in the quasi-linear Mirrlees model with 
a ible empirical parameters. 


V. Decreasing Marginal Tax Rates 


çonsidering decreasing marginal tax 
th Consider only the levels of skills above 
rends to the OO e level at which G’ equals p. At skill 
E tis only effect of Soe above nc, it would be desirable to trans- 


constraint. a sala away from this skill level (if it 






faf very high eamers, values need to be selected - 
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TABLE |—ASYMPTOTIC MARGINAL TAX RATES 


60 31 75 50 23 


Asymptotic marginal tax rates, in percent, with a constant elasticity of labor supply, e, a Pareto distribution of 
ehitis with parameter a, and a ratio of social marginal utility with infinite income to average social marginal utility of g. 


could be done costlessly). It is now conve- 
nient to work with equation (9), which I 
rewrite 


(9) T'K1-T’) 
=[e' + ul J (p= G’) ar | 


+ [pnf(n)]. 


As noted in Lemma B, at skill levels above nc, 
the integral in (9) is decreasing with skill. Be- 
low the mode, the density is rising and so 1/ 
[nf (n)] is falling with skill. Thus with a con- 
stant or rising elasticity of labor supply, the 
marginal tax rate is declining with skill. This 
argument also goes through above the mode 
where nf (n) is rising with skill. This is sum- 
marized in: 


PROPOSITION 3: Above the critical skill 
level, nc, marginal tax rates are decreasing 
where the elasticity of labor supply is constant 
and the distribution of skills has nf (n) rising 
with skill. 


While one would expect nf(n) to be increas- 
ing in n just above the modal skill, empirically, 


` this seems unlikely’ at high skills, as is indi- 


cated in the data discussed below. 

One can also use (9) to compare tax rates 
at two income levels above neç, on either side 
of the modal skill and such that the density is 
equal at the two points. With G’ less than p at 
the lower of the two skill levels being com- 
pared, the marginal tax rate would be higher 
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at the lower income level with a constant or 
rising elasticity of labor supply. This is sum- 
marized in: 


PROPOSITION 4: Above the critical skill 
level, nc, marginal tax rates are higher at the 
lower of two skill levels. that-have .the. same 
density and the same the elasticity of labor 
supply at the chosen labor supplies. 


Combining results, one can see the pattern 
of tax rates when the density of skills is single- 
peaked (and such that the workers with the 
modal skill work and have G’ less than p in 
equilibrium). With a constant elasticity of la- 
bor supply and the Pareto distribution of skills 
where the density is falling, the pattern of mar- 
ginal tax rates is U-shaped above nc, with the 
minimum of marginal rates occurring at the 
modal skill. Moreover, marginal rates are 
higher at the lower income levels. Plausibly, 
the density of skills does not have a kink at the 
mode, but changes smoothly from rising to de- 
clining as a Pareto density. Then, with the con- 
ditions in Proposition 3 the minimum of the 
tax rate (over the range above nc) occurs 
above the modal skill. It is worth reiterating 
that the range with declining marginal rates 
need not begin at zero earned income. 


VI. Another Example 


The assumption of a constant elasticity of 
labor supply relates the optimal tax to a fa- 
miliar concept in the analysis of deadweight 
burdens. By moving the term ‘‘n’’ from C(n) 
to A(n), one finds another eae with sim- 
ilar conclusions. Consider the logarithmic 
case, u(1 — y) = log(1 — y). In this case, the 
elasticity of labor supply is equal to (1 — y)/ 
y. Thus ‘one has: 


LEMMA A’: Ifv(1 — y) = log(1 — y), then 
A(n) =n(i—T’). 


If, above the modal skill level, the distri- 
bution is the exponential distribution, then 
nC(n) is a constant. 


LEMMA C’: Forn < nm, nC(n) is decreas- 
ing inn. Forn > n,,,nC(n) is constant if F (n) 
is the exponential distribution above nn. 


























I can now put together Lemmas A’, B, ay 
C- f 


PROPOSITION 1’: Marginal tax rates an: 
increasing above the modal skill if, above ț; 
skill, the utility-of-leisure is logarithmic qj 
the distribution of skills is exponential. 


For Proposition 1’, it is noted that T'/(1. 
T’)? varies with n as B(n) does. With By 
increasing, so too is T’. As above, from t 
arguments that led to Proposition 1’, one caf’ 
see that the result carries over if, at the equp-. 
librium labor supplies, the elasticity of latg}: 
supply falls with skill more than in the stay 
condition. Similarly, it is sufficient to havf. 
distribution of skills such that [1 — F(n)} 
f(n) rises. Moreover, the result of rising tg 


rates will hold for part of the skill distribute S R 
(above the mode) if the conditions are metfy Srg 
that part; one does not need conditions ont 

entire distribution. FIGURE 


For the case just analyzed, the asymptos 4 
marginal rate is calculated. With a logarithm: be 


distribution of skills (above the mode) wi 
coefficient b, (10) becomes: 


bones 


(18) TUG 
(16) T'/Q -T'Y z B(nX 1- F)f & 

= B(n)/b. 

Solving for T’ and taking the limit, one bash Fro 


PROPOSITION 2’: Assuming an expont$ móde, 
tial distribution of skills above the modal WẸ. ~ + 
with parameter b and logarithmic uty k 


tax rate converges to 
(17) T’=1-[(b'? + 4b')"? — one 


where b' = b/(1 — g). 


b, of 5, 10, and 15 (see Figures 1 and 2),%4 
optimal marginal tax rate tends to 64, 73.8 
77 percent. For g equal to 0.5 and the SẸ 
values of 1/b, the optimal marginal tax” 
tends to 54, 64, and 70 percent. 
Similarly, one can examine conditions” 
declining marginal tax rates with the * 











rr e 





an exponel 


afithmic utility-of-leisure. In this case, (9) 
the mode) 


Becomes: 
4 


18) 


=|" -ona /wr. 


fom Lemma B, it can be concluded that the 
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ROPOSITION 3’: Between the critical skill 
fivel, Nc, and the mode, n,,, marginal tax rates 
@eere decreasing if the utility-of-leisure is 
megarithmic. 


A 


‘Similarly, from (18) one can conclude: 
ROPOSITION 4': Above the critical skill _ 
2 Re, Marginal tax rates are higher at the 


of two skill levels that have the same 
if the utility-of-leisure is logarithmic. 


VIL. Data on the Distribution of Skills 


{While a careful attempt to fit this model to 
ble data is beyond the scope of this pa- 
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ellen (1-Fyni 


relative wage 


Ficure 1. Ratios [1 — F(n)]/f(m) anp [1 — F(n)]/[nf(n)] CALCULATED FROM RELATIVE WAGES 


per, it does seem interesting to examine the 
distribution of wages. For this purpose, cal- 
culations have been done using the March 
1992 CPS. This survey asked individuals for 
annual earnings in 1991, as well as weeks 
worked and typical hours per week. From 
these numbers one can calculate an implied 
average wage.’ Using these wages, calcula- 
tions were made of the mean wage per cell; 
the number of observations per cell, adjusted 
by interval width in order to be proportional 
to the density; and the number of observations 
with higher wages. Approximately 17 percent 
of the sample report wages below $1 or no 
work and are omitted. In order to have reason- 
able cell sizes, the wage intervals are first 
$0.50, but are expanded above a wage of $26. 
As expected, a smoothing of the data would 
show a single-peaked distribution, as assumed 
in the analysis above. In Figure 1 is shown the 
ratios (1 — F)/f and (1 — F)/(nf), where n“ 
is measured as the wage relative to the mean 
wage. Because the series are very noisy, the 
graph is a centered three-cell moving average. 


7 No attempt was made to consider both earners in a 
two-eamer family or wages of single females. 
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N O 0 ND MMO tT tH Om HK A nNnonnr nh HaAaNnNnsaneamnm 
Ss+OQePen Hn OGgoaenmae nN OUR AOHBAA ANS 
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relative scale 


Ficure 2. Ratios [1 — F(n)}/f(n) anp [1 — F(n)]/[nf(n)] CALCULATED FROM RELATIVE SKILLS 


For readability, the lowest wages are dropped 
from the graph since the ratios are very large. 
The figure shows sharply falling values of 
(1 — F)/f through the range where the density 
is first rising and then roughly flat, that is, up 
to a wage of roughly $13, a little below the 
mean of $13.70. Beyond this point (1 — F)/f 
is roughly constant at a value around 15. This 
implies a downward trend in (1 — F)/(nf).A 
constant value of (1 — F)/fis consistent with 
an exponential distribution over this range of 
values. 

With a longer time horizon than one year, 
one would consider education to be an endog- 
enous variable somewhat responsive to tax in- 
centives. One might also be interested in the 
distribution of skills within a cohort. Thus a 
further calculation was done by regressing the 
log of the wage on education, age, and age 
squared, and plotting the exponentiated resid- 
uals. In Figure 2 are the same curves for this 
distribution as shown in Figure 1 for the dis- 
tribution of wages (except that a moving av- 
erage was not used). This distribution shows 
a fatter tail than the distribution of wages, with 
(1 — F)/f rising and (1 — F)/(nf ) roughly 
constant for the top 15 percent of the skill dis- 
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range of values. 


VII. Concluding Remarks 


tuitive grasp of the factors that determin 
optimal tax structure. Increasing the marg 
tax rate affecting some skill level involv 
increase in the deadweight burden for pe 
at this skill level. Thus, the optimal mar 
tax rate at some income level depends 0 
elasticity of labor supply at that income l 
since this is important for marginal distort 
Increasing the marginal tax rate also tra 
income from all individuals with higher ¥§ 
to the government, without changing the 
tortions of their labor supplies. The weight! 
these two elements depend on the ratio 0 
dividuals with skills above this level to 
viduals with skills at this level and on the 
of skill which links the tax on hours to tht] 
on income. This intuition is displayed i 
equations above, where the first-order c% 
tion for the optimal income tax is written *; 
a product of these three terms. 




































ting of the first-order condition 
: w the critical role of the assumed 
f distributions for the upper tail of 
aS opposed to just the value of the pa- 
_ With the Pareto distribution, (1 — 
) is a constant and the change in the 
tax rate reflects the rate of decline in 
inal utility of income as well as the 
"in labor supply elasticity with skill. 
the exponential distribution, (1 — F)/ 
declines at the rate 1/n. Thus either the 
icity of labor supply or the social marginal 
vof income needs to be falling suffi- 
j rapidly to have constant or rising mar- 
tax rates. In the simulations in Mirrlees 
1971), it was assumed that the distribution of 
Sills was lognormal, so that (1 — F)/(nf) 
ines at the rate 1/log(n). Presumably the 
vely constant marginal tax rate in the 
simulations would have had a different 
a with a different assumed family of dis- 
butions. Exploration of the shape of this dis- 
i stion is clearly important for the normative 
Aao for different degrees of income tax 
THEressivity. 
ie there is not a simple route between the 
Mirrlees model and policy implications for an- 
‘qual income taxes levied repeatedly on fami- 
3 and covering both capital and labor 
‘mes. The assumption of a zero income 
A@masticity of labor supply and the limited in- 
gmeemation on both the shape of the skill distri- 
i ñ and the pattern of elasticities of labor 
ply by skill level would limit inferences 
if there were a simple route. Nevertheless 
are some lessons from the analysis. The 
fall in (1 — F)/f as skills approach the 
$ of the skill distribution from below 
highly relevant, especially if one wants 
redistribute from people near the mode, 
afer than to them. This finding on the shape 
“0 optimal (negative) income tax seems rel- 
in thinking about the phaseout of the 
4 income tax credit, and, possibly, wel- 
orm. That is, labor supply depends on 
return to earnings, which, in turn, de- 


ILLS j 


ion the n both the income tax and the phaseout 
irs to UE me-tested benefits. The presence of 
slayed J Marginal tax rates in the region of phase- 


ibe 


hefits is not necessarily a basis for 
of the programs—the optimal pro- 
Well have such a shane hecanse of 


order g 
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the advantage of higher marginal rates over a 
shorter range of skills where the skill density 
is large and rising. In other words, a sizable 
implicit marginal tax rate where benefits are 
being phased out is consistent with the U- 
shaped pattern of marginal rates and may well 
be optimal. 

Second, this model confirms the implication 
of Mirrlees’ calculations that the optimality of 
a zero tax rate at the highest income level is 
not a finding that sheds much light on optimal 
taxes, especially in the absence of knowledge 
of exactly where the top is. That is, if one re- 
placed an unbounded distribution of skills by 
a bounded one with the same distribution up 
to some level and a concentration of skills at 
the highest levels, the result of rising marginal 
tax rates continues to hold until the concentra- 
tion at the top is reached. There is no need for 
tax rates to decline slowly toward zero as one 
approaches the absolute top of the skill 
distribution. 

Third, the sensitivity of the pattern of mar- 
ginal rates to the measure of skill seems rele- 
vant, although different formulations of 
*‘skill’’ will be associated with different esti- 
mates of the elasticities of labor supply as well 
as different estimates of the shape of the dis- 
tribution of skills. This analysis emphasizes 
the importance of the shape of the distribution 
of skills for optimal tax rates. 


APPENDIX: HEURISTIC DERIVATION OF THE 
OPTIMAL TAX FIRST-ORDER CONDITION 


Using just the first-order condition for la- 
bor supply for the quasi-linear utility func- 
tion as a constraint, the optimal tax problem 
is 


(A1) Max [6 {ul x(n),y(n)1) fdn, 


subject to: T x(n)f(n) dn 


no 


< f ” ny(n)f(n)dn— E; 


no 


vfi —vin\l =n(1— T'Inv(n)1). 
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With x(n) = ny(n) — T[ny(n)], and using 
the first-order condition for labor supply, the 
change in consumption with skill satisfies: 
(A2) x'(n) = y(n)(1 —- T’) 
+n(l—T')y'(n) 


. [y(n) + ny'(n)]v'/n. 


With the quasi-linear utility function, one can 
calculate the derivative of u with respect to n: 


(A3) u'(n)=x'(n)-—v'y'(n) 
= y(n)v'/n. 
Treating u(n) as a state variable and y(n) as 


a control variable, the optimal tax problem can 
be rewritten as 


(A4) Max r G[u(n) ] f(n) dn, 
subject to: 


f” (acm — 001 = yon) Fo an 


= T ny(n)f(n)dn — E; 
no 


u'(n) = y(n)v'[1 — y(n) ]/n. 
Forming a Hamiltonian for this expression, 
H = {G[u(n)] — p[u(n) 

— [1 — y(n)] — ny(n)]} 
x f(n) + h(n)y(n) 


x v'[1 — y(n) ]/n, 


(A5) 


where p and h(n) are multipliers. The deriv- - 


ative of h is equal to minus the partial deriv- 
ative of the Hamiltonian with respect to u: 


(A6) h'(n)= —{G'[u(n)] - p} f(n). 


























Maximizing the Hamiltonian with respech 


y(n), ag ) e: Ca 
“ae: Ce 
(AT) -p{n-v'll-y@) 150) BR 


= h(n) {v'[1-y(n)] 


—y(n)v"[1—y(n) } ng 

nes : $ ntive Ef 
Recognizing that A(n) is equal to zero, (Age n 

can be integrated from n to n, to have anaf 

pression for h(n). Substituting in (A7), o$ 

then has the first-order condition in the tey 


(8). 
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Endogenously Chosen Boards of Directors 
and Their Monitoring of the CEO 


By BENJAMIN E. HERMALIN AND MICHAEL S. WEISBACH* 


How can boards be chosen through a process partially controlled by the CEO, 
yet, in many instances, still be effective monitors of him? We offer an answer 
based on a model in which board effectiveness is a function of its independence. 
This, in turn, is a function of negotiations (implicit or explicit) between existing 
directors and the CEO over who will fill vacancies on the board. The CEO’s 
bargaining power over the board-selection process comes from his perceived 
ability relative to potential successors. Many empirical findings about board 
structure and performance arise as equilibrium phenomena of this model. (JEL 


D23, D73, G39, K22, L29) 


Corporations are not governed by the pro- 
cess that corporate law would seem to imply. 
Corporate law states that shareholders choose 
the board of directors, but, in practice, share- 
holders almost always vote for the slate pro- 
posed by management.' Moreover, this slate is 
approved by, if not chosen by, the very CEO 
these directors are supposed to monitor (see, 
e.g., Myles L. Mace, 1971; Jay W. Lorsch and 
Elizabeth MacIver, 1989; Ada Demb and 
F.-Friedrich Neubauer, 1992). The resulting 
governance system has been criticized as in- 


* Hermalin: Haas School of Business and Department 
of Economics, $545 Student Services Building No. 1900, 
University of California, Berkeley, CA 94720; Weisbach: 
Department of Finance, College of Business and Public 
Administration, McClelland Hall, Room 315R, University 
of Arizona, Tucson, AZ 85721. The authors thank James 
Brickley, Aaron Edlin, Robert Gibbons, Charles Hadlock, 
David Hirshleifer, Steven Kaplan, Vojislav Maksimovic, 
Canice Prendergast, Anil Shivdasani, Sunil Wahal, Jerry 
Weisbach, two anonymous referees, and seminar partici- 
pants at the University of Alberta, University of Arizona, 
Boston College, University of California-Berkeley, 
UCLA, University of California-San Diego, University of 
Chicago, Cornell University, Dartmouth College, Hong 
Kong University of Science and Technology, University 
of Hlinois, Indiana University, University of Maryland, 
University of Rochester, and the Winter Econometric So- 
ciety for their helpful comments and suggestions on earlier 
drafts. 

! Even when shareholders do challenge management’s 
slate of directors in a proxy fight, Harry and Linda 
DeAngelo (1989) find that they win a board seat only 
about one-third of the time. 
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effective in controlling management (see, ee 


Michael C. Jensen, 1993). 

Given these apparent shortcomings, it} 
easy to forget that the current system is, now 
theless, the market solution to an organiuf. 
tional design problem (albeit one that mustké 
solved under legal constraints—e.g., all 
must have boards with certain powers ).” Tht 
as George J. Stigler and Claire Friedlan 
(1983) argued, before any criticism of cumeéf.-: 
practice is taken too seriously, a thorough w: 
derstanding of the market forces that have 
to its existence seems necessary. This, in pa 
is what we propose to do here. 

The previous literature has focussed 
what boards do, without asking how they % 
to be the way they are. However, the answë: 
to these questions are invariably linked. 
instance, a board packed with the CEO’s® 
atives will be less effective than one made& 
of large shareholders. To understand corpo 
governance, the questions of director choi 
and director function must be answé™ 
simultaneously. ; 

To capture this simultaneity, we assume” 
the board and the CEO negotiate over both! 4 


? Mark Roe (1994) expands on the role of legd $, 
political constraints. These, however, are better suid 
explaining cross-country differences in corporate 
nance than the intracountry differences that are ouf 

















































rs wage and the identity of new directors. 
oest negotiations could be explicit. Alterna- 
"w. in keeping with the institutional litera- 
: s (see eg Mace; Lorsch and MacIver; 
$? Demb and Neubauer), these negotiations 
Ha id be implicit—the CEO could nominate 
E ew board members subject to a tacit under- 
sanding about the set from which they may 
chosen. Were the CEO to violate this un- 
ding, the board would refuse to ap- 
ve his nominees. The CEO’s bargaining 
power in these negotiations comes from his 
narceived ability relative to a replacement. 
f These negotiations determine the board’s 
level of independence. Independence is im- 
portant because a director’s willingness to 
pnitor the CEO increases with his or her in- 
dependence. Monitoring provides information 
ent (see, $8 Fabout the CEO used by the board in deciding 
Besar 19) f whether to retain or to replace him. In this 
’ t model, therefore, both the structure of the 
‘AEF board and its actions are endogenously 
‘derived. : 
f: To evaluate the model’s realism, we com- 
pare its predictions to existing empirical find- 
ame ings. Some of the model’s predictions are: 





È 


» ACEO who performs poorly is more likely 
-to be replaced than one who performs well. 
CEO turnover is more sensitive to 


independent. i 
‘3. The probability of independent directors 


poor firm performance. 
Board independence declines over the 


better predictors of management turnover 
"than stock-price performance. 


e predictions are consistent with existing 
ities of large corporations. 
“Tedictions | and 2 match the empirical ev- 
hee on CEO tumover: A number of papers 
‘Anne T. Coughlan and Ronald M. 
ee t, 1985; Jerold B. Warner et al., 1988; 
ferien and Kevin J. Murphy, 1990), find that 
turnover is negatively related to prior 
ance. In addition, Weisbach (1988) 
the sensitivity of CEO turnover to 
lance is greater for firms with a higher 


being added to the board rises following 
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proportion of outside directors, presumably 
because these firms’ boards are more indepen- 
dent of management than are boards domi- 
nated by inside directors.* 

While prediction 1 follows from other mod- 
els (e.g., David Hirshleifer and Anjan V. 
Thakor, 1994), prediction” 2“is more novel. 
The intuition behind it is that new CEOs re- 
quire more monitoring than old ones, since 
less is known about their ability. More inde- 
pendent boards have a greater tolerance for 
this added monitoring, so they can afford to 
be tougher with an incumbent CEO whose per- 
formance is marginal. , 

Prediction 3 is consistent with the pattern of 
director turnover found by Hermalin and 
Weisbach (1988). They also find that the pro- 
portion of outside directors on the board de- 
creases over the CEO’s career. This finding 
suggests that board independence declines 
over a CEO’s tenure, consistent with predic- 
tion 4. 

Prediction 3 follows because poor perfor- 
mance lowers the board’s assessment of the 
CEO’s ability, reducing his bargaining posi- 
tion and thus increasing the probability that the 
CEO will be forced to accept more indepen- 
dent directors. Similar logic explains predic- 
tion 4: If a CEO keeps his job, then retaining 
him must be worth more to the directors than 
replacing him. This means that this CEO is, to 
some extent, a rare commodity, which gives 
him bargaining power vis-a-vis the directors. 
He is, therefore, able to bargain for a board 
that is more favorable to him. 

Prediction 5 is consistent with both 
Weisbach (1988) and Murphy and Jerold L. 
Zimmerman (1993), who estimate equations 
predicting management changes using both 
stock returns and earnings and find that earn- 
ings do a better job. Intuitively, earnings are a 
function of current management only, but 
stock returns reflect both current management 
and the expectation ‘of future management 
changes. 


3 Outside directors are nonmanagement directors not 
otherwise affiliated with the firm and inside directors are 
management or directors with close ties to the firm (e.g... 
ex-CEOs of the firm). Some researchers also include a 
third category, ‘“‘grey’’ or “‘affiliated,’’ intermediate in 
presumed independence between outsiders and insiders. 
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In addition, the analysis suggests the follow- 
ing predictions, which have not yet been em- 
pirically tested. . 


1. There should be long-term persistence in cor- 
porate governance. In particular, changes that 
either strengthen or weaken board indepen- 
dence should be ‘permanent’? in. that they 
change the long-term bargaining strength of 
the board against management. 

2. The stock-price reaction to management 
changes should be negative if the manager 
is fired on the basis of private information 
and positive if the manager is fired on the 
basis of public information. 

3. A CEO’s salary should be insensitive to 
past performance at relatively low levels of 
past performance, but sensitive at relatively 
high levels of past performance. 


The first prediction is consistent with anec- 
dotal evidence of long-term patterns in cor- 
porate governance. When a firm has an 
extremely able CEO, he will be able to use his 
bargaining position to ensure a relatively weak 
board throughout his career. Consequently, his 
successor will inherit a relatively weak board. 
Thus, the model suggests that there will be 
long-term persistence in firms’ governance 
practices and long-term interfirm heterogene- 
ity in these practices as well. 

The second prediction potentially explains 
why empirical studies have found no consistent 
stock-price reaction to management changes 
(see Warner et al.): there have been no controls 
for whether the dismissals were due to private 
or public information. Our prediction follows 
because a change in management conveys in- 
formation about both the board and the CEO. 
If the board bases its firing decision on private 
information, then a firing reveals that a CEO 
who was previously seen as better than the ex- 
pected value of a replacement is not. Expecta- 
tions of new management are lower than the 
previous expectations of old management, so 
the stock price falls. In contrast, if the firing is 
based on public information; then nothing new 
is revealed about the CEO, but the firing con- 
veys good news about the board’s indepen- 
dence, so the stock price rises. 

The third prediction comes from the struc- 
ture of the bargaining game in our model. 



























When the CEO is either new or a medic 
performer, the lower bound on his wage hi, 
(a bound stemming from a limited-liability,, 
sumption). As performance increases, 4 
CEO’s bargaining position increases as wel 
allowing him to capture a fraction of the m 
in the form of a higher wage. Í 
Although we focus on explaining phenog§’ 
ena related to boards of directors, the mug 
we develop is fairly general. It extends thejg 
matching model of Boyan Jovanovic (1978: 
by allowing for endogenous monitoring deg}; 
sions. Among its features is a formalizationg?: 
the argument that new workers are more v}. 
able than older, better-known workers, cetg 
paribus, because the former have a greater mf 
tion value (see Proposition 2 below). -$ 
The paper has the following organizatigg 
The next section reviews some previous wai ; 


A: 


our model. We extend it in Section W, s 
section A, by allowing the board to prefer 
incumbent CEO to replacements. In Seeti 
Il, subsection B, we consider how vario 


tion IV considers £ a reinterpretation of 0 
model that eschews bargaining and ensure i 
that turnover is always optimal from the shang ` Unió 
holders’ perspective. Many of our resuli 
continue to hold under this alternatitg 
interpretation, although we lose the ability 
analyze management capture of the boang VEAC of co! 
Section V considers some policy prescripiss Weisb: 
that have been offered to correct the percel 
failings of boards (e.g., requirements that 6g 
rectors be paid in stock rather than cash). %% 
model predicts that many of these polici 
be ineffective. We conclude in Section 


ownership and control inherent in a div 
held corporation leads to a board of dit 
controlled by management. They ae 4 


those who select the proxy commi 










om, the election of directors 

a enn period will be made. 

Since the proxy committee is ap- 

Sin ted by the existing management, the 

: Tiet can virtually dictate their own 
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{snagement’s apparent control of the board 
ection process led Berle and Means, as well 
imore contemporary authors such as Jensen, 
~uestion whether directors can be effective 
itors. . 
counterargument is that directors’ con- 
for their reputations in the managerial or 
orial labor market causes them to be ef- 
tive-monitors (Eugene F. Fama, 1980; 
ma and Jensen, 1983). Steven N. Kaplan 
g organiza ndj David Reishus (1990) find evidence con- 
e previous ya -istent with this argument: directors of poorly 
n I introdi Fecrforming firms, who therefore may be per- 
ection M, $E écived to have done a poor job overseeing 
(Wi management, are less likely to become direc- 
A (ays at other firms. On the other hand, as Bengt 
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‘To resolve these conflicting arguments 
about board effectiveness, an empirical liter- 
ature assessing the board’s role has developed. 
i Unfortunately, measuring the day-to-day ef- 
of our resi fect of board independence on corporate prof- 
ts is made difficult by simultaneity problems, 
ty pi fe board independence itself changes be- 

atase of corporate performance (see Hermalin 
licy prescri d Weisbach, 1991, for further discussion ).* 

this literature has been more successful 
halyzing the relation between corporate 
decision-making in specific circumstances and 
me measure of the board’s independence, 


pe a 























Some papers that nonetheless attempt to estimate the 
to-day effect are Paul MacAvoy et al. (1983) and 
ry D. Baysinger and Henry N. Butler (1985). Also see 


ere ts A. Brickley and Christopher M. James (1987), 
T aa ae at Rosenstein and Jeffrey G. Wyatt (1990), Wiiliam 
Separa g and Michael T. Maloney (1992), Omesh Kini et 
rent in a diveg ), Anup Agrawal and Charles R. Knoeber 
board of dir ), Sanjai Bhagat and Bemard Black (1996), James 
They argued Wks th and Daniel N. Deli (1996), Kenneth A. 
Bh “ovich et al. (1996), David Yermack (1996), 

i = Cotter et al. (1997), Robert Gertner and Kaplan 

in the hands Si ey jovial Mayers et al. (1997), and April Klein 
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such as its composition in terms of insiders and 
outsiders or other characteristics correlated 
with independence (e.g., directors’ sharehold- 
ings). For instance, Brickley et al. (1994a) 
find that board composition affects their de- 
cision to adopt poison pills; John W. Byrd and 
Kent A. Hickman (1992) find that the greater~ 
the fraction of outside directors, the better the 
stock market’s reaction to their firm’s tender 
offers for other firms; and Weisbach finds that 
the sensitivity of CEO turnover to firm per- 
formance increases with the fraction of outside 
directors on the board. Anil Shivdasani (1993) 
suggests that outside directors who own a sub- 
stantial number of shares and who hold more 
corporate directorships (presumably measur- 
ing the value they place on their reputations ) 
are better at negotiating a favorable deal for 
shareholders who face a takeover bid. Lastly, 
Kevin Hallock (1997) finds that firms whose 
boards are interlocked (contain a CEO on 
whose board the firm’s CEO serves) tend to 
pay their CEOs more. He argues that inter- 
locked directors are less independent and, con- 
sequently, give the CEO a larger fraction of 
the rents than would other directors. 

While the empirical literature on boards is 
fairly well developed at this point, there has 
been little formal modeling of the board. Ex- 
isting papers have considered important is- 
‘sues about the board’s conduct, but have 
ignored the process by which boards get to be 
the way they are (see Hirshleifer and Thakor, 
1994; Vincent A. Warther, 1994; Brickley et 
al., 1994b; Thomas H. Noe and Michael 
Rebello, 1996). Our view is that such mod- 
eling is useful, but it ignores the fundamental 
question of how boards can, in many in- 
stances, be effective monitors of the CEO de- 
spite being chosen through a process partially 
controlled by him. 


ite The Model 


We model the board selection process as a 
bargaining game between the CEO and the 
board. We assume no active role for the share- 
holders (although some of them could be 
directors). Consistent with practice, share- 
holders simply ratify the slate put forward by 
the company. We discuss shareholder activity, 
particularly in crisis situations, later. 
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A. Timing 


The game has multiple stages with the fol- 
lowing timing. 


1. At the start of the game, the firm has a new 


CEO. The commonly held prior.distribution __. 


about his ability, a, is normal with mean 
zero and variance 1/7» (Tq is the precision 
of the distribution). We set the mean to 
zero for convenience, but without loss of 
generality. 

2. The first realization of earnings, x,, occurs. 
Earnings are distributed normally with a 
mean equal to the CEO’s true ability, a, and 
a variance equal to 1/r. 

3. Based on x,, the board updates its estimate 
of the CEO’s ability. The board may at this 
stage decide to fire the CEO and hire a re- 
placement. The prior distribution of any re- 
placement CEO’s ability is normal with 
mean zero and variance 1/ro. 

4. The CEO (either the incumbent or the re- 
placement) negotiates with the board over 
filling vacancies on the board and his wage, 
w.° If the bargaining is unsuccessful, the 
CEO is fired and a replacement is hired. If 
a replacement is hired at this stage, the 
board bargains with him about the filling of 
vacancies on the board. 

5. The board may then acquire a private sig- 
nal, y, about the CEO. The probability that 
the board acquires this signal depends on 
the intensity with which it monitors the 
CEO. The signal is distributed normally 
with a mean equal to the CEO’s ability, a, 

and a variance equal to 1/s.° 


5 We treat the creation of vacancies on the board as 
exogenous. This is fairly realistic, since many vacancies’ 
arise for presumably exogenous reasons such as death, ill- 
ness, or reaching the customary retirement age. We also 
do not explicitly consider how the number of vacancies 
might limit the ability of the board to adjust board com- 
position. This is with little loss of generality, since (i) only 
the proof of Proposition 6 depends on achieving an interior 
solution (i€; onè in which vacancy limits do not bind); 
and (ii) board size often changes (see Hermalin and 
Weisbach, 1988, for evidence). 

6 Alternatively, we could assume that the board always 
receives a signal, but its precision (i.e., s) is endogenous 
(we thank Canice Prendergast for this point). This alter- 
native leads to similar results, except the comparative stat- 
ics with respect to 7 are ambiguous. 
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6. If the board acquires the signal, it upds, 
its estimate of the CEO’s ability. a 
this posterior estimate, the board may 4§ 
cide to fire the CEO and hire a replacems ms 

7. The second realization of earnings (prh 
gross of the CEO’s compensation), x, ab 

, curs. Again, earnings are distributed w 
mally with a mean equal to the CEQ’s ms 
ability and a variance equal to 1/r. The 
dom variables y — a, x; — @, and x, ~ 
are independently distributed. 


B. Preferences and Ability 


The CEO in charge at stage 7 receives -SECO 
control benefit of b > 0. A CEO who is ġ¢ ton 


missed prior to this stage (or not si a 


ceives a benefit of 0. 


vives to stage 7. A critical assumption is Mi H 
the CEO is protected by limited: liabiliyg 
specifically, the wage must be nonnegiatik 
(i.e., w = 0). 

A 'CEO’s ability is fixed throughout bisak: x 
reer. We follow Holmstrom (1983) by assu 
ing that the CEO, like the board, knows of. 
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the distribution of his ability (i.e., that itii ae of the 


normal distribution with mean zero and pty 


cision To). We justify this assumption by mg “cin 


ing that the uncertainty about a CEO’s abili 
in a particular job is largely uncertainty aW a y 
the match between him and the firm, whichi¥ 
similarly unknown to both the board and Be 
CEO. 
We assume that each director, i’s, uty as 


(1) bx — K;d(p). 


The constant 6; > 0 equals the director's 1 
ginal utility from firm profits, x2.” We ima? oe 
that directors put different weights on p E g 
for two reasons, First, directors’ incomes?% 
pend on their own shareholdings. at | 


tors act, so we need not explicitly account for theif # if af 
on directors’ utilities. . 
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ncems for building reputations as 
> agers may vary. 
The bee is the probability that the 
‘obtains an additional signal, y, about 
CEO. It reflects the intensity with which 
board monitors the CEO. The disutility 
snitoring is Ki d(p), where d(-:) is a 
on, strictly increasing, strictly convex, 
twice-differentiable function and K; 1S 
director's distaste for monitoring. We 
e that directors’ distastes for monitor- 
vary for three reasons. First, inside di- 
* careers are tied to the CEO’s, so they 
ly find it in their interest to monitor him. 
ond, the opportunity cost of the direc- 
“5° time will vary among outside directors. 
br not hired{ilmginally, directors who value the opportunity 
(Mio serve on other boards could have an in- 
ensated witgmeentive to establish reputations for not 
bargaining rocking the boat’’; i.e., for not intensely 
board. This‘vjmmanitoring the CEO. 
her the CEO Since utility functions are defined up to an 
ssumption ist ¥fline transformation only, we can replace (1) 
limited liabi 
be nonnegif 
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xı — k;d(p), 

Where kı = x,/6,. We interpret k; as a measure 
mi director i’s lack of independence, at least in 
ms of the way he or she behaves. 

iWe assume k; is fixed for a given director i 
| Specific firm and for a specific set of pe- 
ary incentives). In particular, it is invari- 
‘with respect to who the CEO is. We relax 
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n nature of our results. In addition, we 


iss below the implications of basing in- 
ual director monitoring costs on the 
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where z is either x, or y and ¢ is either r or s 
(see Morris H. DeGroot, 1970 p. 167). The 
posterior distribution is also normal. 

From (3), the board has a more precise es- 
timate of an incumbent CEO’s ability at stage 
3 than it would.of any replacement CEO it 
hires. That is, 


(4) T >To, 


where 7 is the precision of its estimate of the 
incumbent CEO’s ability. Intuitively, an in- 
cumbent is a ‘‘known entity,” so there is less 
uncertainty about him than there would be 
about a new CEO. 

The distribution of the signal y given the 
CEO’s true ability, a, is normal with mean 
qa and variance 1/s; hence, the distribution 
of y given the CEO’s estimated ability, â, is 


normal with mean â and variance i/s 
+ 1/7. Define 


ST 
SET 





to be the precision of y given â? 

Observe that the board’s posterior estimate 
of a CEOQ’s ability is also the expected value 
of xı. After fixing (sinking) the CEO’s wage, 
it is also the expected value of profits. 

The alternative to retaining a CEO is to hire 
a replacement. The expected earnings from a 
replacement are, by assumption, zero. More- 
over, because all replacements are ex ante 
identical, they have no bargaining power. 
Hence, the directors can set a minimum wage, 
w = 0. The expected profit from a replacement 
CEO is, therefore, zero. Subsequent to obtain- 
ing a signal, y, the incumbent CEO will thus 


8 The random variable y — â is the sum of two inde- 
pendently distributed normal variables: y — æ and a — 
â; hence y — â is also normally distributed. Since the 


means of these two random variabies are both zero, the - 


mean of y given @ is, therefore, &. The variance of the two 
variables are 1/s and 1/7 respectively, so the variance of 
y — & and, thus, y given @ is 1/s + 1/7. 

? As a convention, we will denote functions of many 
variables, such as H, by capital letters. When we need to 
be explicit about an argument of such functions—for ex- 
ample, the function F evaluated at x = x'—we will write 


rare 


he 





bat 


we 
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be dismissed if @’ < 0. Using (3), we can 
restate the dismissal condition as 


(5) = Sy: 


The firm’s expected value if it will learn y 
is 


-e * ~. 


Ve i maxi 0 isa H -umoa dy. 
= THs Qn 


Since the option to fire the CEO is a valuable 
option, it follows that V > @ for all r. 

Straightforward calculations reveal that V 
can be written as 





(6) V=a6(-(¥, — 4)VH) 


+ HH sca, - @)VH), 


where Y, is the left-hand side of (5), ®(-) is 
the distribution function of a standard normal 
random variable (i.e., with mean zero and 
variance one), and $(-) is its corresponding 
density function. Note that 


&(—(¥, — &)VH) 


is also the probability that the CEO will be 
retained if evaluated. 

A higher-ability CEO is always better, but 
the value of the option to fire him is decreasing 
in a: 


LEMMA 1: V is increasing in &, while V — 
â is decreasing in â. 


That is, the value of additional information 
about the CEO’s ability is smaller the greater 
is the prior estimate of his ability. — 

Consider, now, the issue of how the_board 
decides on the intensity (probability, p) with 


10 All proofs are in the Appendix. 





















board chooses p to maximize: 


(7) max pV + (1 —p)max (0, â} — kd Seiessption 
p € [0.1] p ; 


where k reflects, in some way, the colle 
preferences of the board (i.e., Ok/Ok; = 
all i and strictly positive for at least one i) 
instance, k could be the average of the ach 
More consistent, perhaps, with theories of yf aes 
ing, k could be the median k;. Note that x ind: 
resulting p will be Pareto optimal from the pg’ 
spective of the board members. ‘ 
The first-order condition for (7) is 


(8) V- max{0, â} — kd'(p) =0. 


Expression (7) is concave in p, so (8) iss hen t 
ficient as well as necessary. Define P* to} a 
the solution to (8). To keep the analyse 
straightforward, we consider only interiors 
lutions (i.e., P* € (0, 1)). Corner solutio 
are a relatively simple extension. Properties 
P* are: 4 


PROPOSITION 1: The intensity with whidp: 
the board monitors the CEO, P*, is 


(i) decreasing with its prior estimate ofi 
ability, &, if a = 0; 

(ii) decreasing with the precision of its pnt 

estimate, T; 

(iii) decreasing with its collective lack of ite 

dependence, k, but 

(iv) increasing with the precision of the 4%: 

nal (i.e. 5). 


Intuitively, the more costly monitoring 
to the board’s members (or the less wel 
they place on the firm’s profits), the gre 
is the marginal cost of monitoring, so & 
engage in less of it. The more able the bot% 
believes the CEO to be, the less valuable err 
the option to fire the CEO, so the board m$ 
itors less. The option to fire the CEO is* 
ilarly less valuable the less uncertainty thee 
is in its prior estimate, so the board monte: ; 
less intensely when the CEO’s abili Ecg 
known more precisely. However, the op” 4 Sats, 
to fire the CEO is more valuable the gf 
is the precision of the signal, so the 














ors more intensely when the signal is 
. tive. 
m 1 is consistent with the general 
sntion that less-independent boards do 
-monitoring and that long-established 
(ie, CEOs with high values of 7) re- 
less scrutiny. Being monitored increases 
fikelihood of being dismissed, so Propo- 
za 1 is also consistent with the evidence in 
isbach, which suggests that outsider- 
nated boards (which are presumably 


, the colle 
Jkl ðk, =i 
east one i)'¥ 
Age of the 
theories ofa 






















. Note thi @i.oce independent) are more likely to fire 
hal from the f; poorly performing CEO than insider- 
ie MER minated (less-independent) boards. 

(7) is ae 


D. Negotiations Between the CEO 
and the Board 
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{iq surplus to the bargaining table, where R is 
ia share of b the board can expéct to capture 
öm a replacement CEO. However, given the 
@piimited-liability assumption (w = 0), the 
sgmeooard cannot capture any share of b from a 
spafeplacement CEO. Consequently, R = 0. The 
or estimate ofgmtsctmbent CEO brings his expected benefit, 
cision of its pB.. 


-1 R d 
ective lack 


cision of the, : H the table. So their joint surplus is 
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‘ever, the oP- Soang (10) to (8), we see that the mar- 
table the gy FHN of monitoring is lower in (10) by 
al, so the DEE )b, which means that the level of mon- 


+ (p®(—(Y.- a)VH) +1-p)b. 


Maximizing (9) with res i 
à pect to p yields the 
st-Order condition 


V ~ max {0, â} — kd'(p) 
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itoring that maximizes joint surplus (9) is 
lower than the level of monitoring that maxi- 
mizes the board’s expected utility (7). That is, 
if P** is the solution to (10), then P** < P*. 
_ It is worth considering why P** < P*. Part 
of the surplus that can be shared by the incum- 
bent CEO and the board is the incumbent’s 
chance of getting the control benefit, b. If he 
is fired, then this chance is lost; it goes to the 
replacement CEO. Moreover, limited liability 
prevents the board from recapturing it by set- 
ting a negative wage. Consequently, the mar- 
ginal joint benefit of monitoring is reduced." 
We assume that the board and the CEO can- 
not contract directly on the probability that the 
board will evaluate the CEO (i.e., p). This as- 
sumption is consistent with the general per- 
ception that it is difficult for outside parties to 
verify how diligent the board is in its moni- 
toring function (if it were easy for outside 
parties to verify the board’s diligence, presum- 
ably the board would contract with the share- 
holders on this issue). This, however, creates 
a problem, because the board’s private incen- 
tive is to choose a level of monitoring greater 
than that which maximizes joint surplus; i.e., 
the board would choose P* instead of P**. 
The only way for the CEO and board to 
avoid this problem of too much monitoring is 
to lower the board’s incentives to monitor by 
raising k. We interpret the negotiations over 
k as decisions over factors likely to affect the 
independence of the board, such as board com- 
position (e.g., proportion of insiders versus 
outsiders), board compensation, and so forth. 
We assume that when the board negotiates 
with the CEO over k and w it cares about its 
utility only; that is, it does not consider 
the new (future) directors’ utility in their 
negotiation. 
Let ky denote the collective lack of inde- 
pendence of the continuing directors.'* If 


*"In many ways, ‘the situation is similar to Philippe 
Aghion and Patrick Bolton’s (1987) exclusive-dealing 
model. There, a retailer (our board) and a monopoly pro- 
ducer (our incumbent CEO) enter into an exclusive- 
dealing contract because of their concern that an entrant 
(our replacement CEO) will capture future surplus. 

'2Note the flexibility to change board composition 
comes from filling exogenous vacancies or adding direc- 
tors to the board—no continuing director need leave to 
realize a change in board composition. 
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bargaining with the CEO yields a new board 
with a different lack-of-independence param- 
eter, k,, then the continuing directors’ ex- 
pected utility is 

Př- V+ (d a}: 


(11) — Pf_z,)max {0, 


qa 


— kyd(P#27,) — w. 


Observe that the equilibrium probability of ob- 
taining a signal, P*, is a function of the new 
board’s lack of independence, k,, not the con- 
tinuing directors’ lack of independence, ko. 

We model the negotiations between the 
CEO and the board as a Nash bargaining 
game: the CEO and board agree to the level of 
independence (i.e., k) and wage that maximize 
the product of their’surpluses from trade. Pro- 
vided the limited-liability constraint does not 
bind, the resulting composition (i.e., k) will 
also maximize their joint surplus.'* Assuming 
that the CEO has bargaining power is consis- 
tent with the institutional literature on boards, 
which suggests that CEOs, both in the United 
States and abroad, have considerable say over 
who is nominated for board positions. It is also 
consistent with the view that a CEO who has 
proven himself to be more valuable (in expec- 
tation) than any potential replacement should 
have some degree of bargaining power. 

The surplus of the players is the difference 
between what they expect to receive if an 
agreement is reached and what they expect to 
receive if no agreement is reached. If no agree- 
ment is reached, the CEO leaves the firm—in 
which case his utility is 0. The CEO’s surplus 
is, therefore, 


P*O(—(Y, — &)VH)b + (1 — P*)b + w. 


If no agreement is reached, the board hires 
a replacement CEO. Let Up be the board’s ex- 
pected utility if it hires a replacement (we will 


83 Other bargaining games would yield qualitatively 
similar results provided the CEO’s bargaining power in- 
creases with his perceived ability. 










derive its value shortly—see Lemma 2 rE 
low). The board’s surplus is, thus, '* 


P*V + (1 — P*)& — kod(P*) — w~ u, ae 


Under Nash bargaining, the board and 
CEO choose k and w to maximize 


a An ~ 


(12) (P*V(4,7)+(1 — P*)é Propositic 
aluate a ¢ 


— kod(P*) — w — Uo) 
x (P*5(-(Y. — &)VH)b 


+(1- P*)b +w). 


To maximize (12), we need to know 
value of Up. 


LEMMA‘ 2: Up = PoVo — kod (Po), where Wd. 
is V evaluated for anew CEO—i.e., 


VH 


Vo = a (0) poe 
° 4 ‘PROPOSIT 
A ‘abllity for tl 

















—and P, solves the equation 
Vo = kod! (pie 


that is, Po is the E board’s utilin§ 
maximizing level of monitoring of a new CE 
Moreover, the wage paid a replacement CE. 
is zero. 


Intuitively, new CEOs have no bargainifi 
power, since they all have equal expect 
value. Consequently, the board can set a më 
imum wage and get its most preferred leveld 
independence, which is to replicate its cup, 
level (i.e., ko). ` 
Recall our assumption that the board d 
choose to fire the CEO prior to bargaining’ §, 
might, at first, seem that the board would & k 
the incumbent CEO if. and only if his estimi” g 
ability were rs than ihe esninated ability 


1 The reader will note that we have replaced nal a 
0} with ĉ. As we will show in Proposition 2, any ind 
bent CEO who is not fired prior to bargaining mut 


an estimated ability greater than zero. 
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<b ement (i.e, if and only if ê < 0). This 
He For, however, true: 

fey SITION 2: A unique finite cutoff, 
apne such that an incumbent CEO is 


rior to bargaining if and only if his es- 


cs 
3 





imize B imated ability is less than A,. 
P*)a Fe < proposition 2 follows because the right to 
evaluate a CEO creates a valuable option. 
- Up) Mihe value of this option increases with the 
rior uncertainty about the CEO (i.e., 1/7). 
&)VH)b Consequently, its value is greater for a new 
‘CEO than for an incumbent CEO. A new 
+w). CEO is, therefore, more desirable than an in- 
bent CEO ceteris paribus. So, an incum- 
eed to know@ nt’s estimated ability must be strictly 
i greater than a new CEO’s if he is to retain 
is job. 
d (Po), wher te A natural question to ask is whether less- 
Q—i.e. independent boards tolerate worse perfor- 
WS mance than do more-independent boards; that 
Fis, do greater values of ko lead to lower values 
RE of A.. The answer is yes: 
ROPOSITION 3: The minimum estimated 
1 ability for the incumbent CEO at which he will 
l kbe retained, A,, falls as the board becomes less 
: 0; A@pindependent; that is, 
board’s utili p dA. zi 
1g of a new UAE Oko 


eplacement (EE 


Remark I: Since ĉ is a decreasing function of 
Performance [recall (3)], Proposition 3 im- 


plies that CEO dismissals are more sensitive 
(© (negative) firm performance when the 
oreferred leva! fii 18 more independent. As such, Proposi- 
olicate its cum nc” 3 is consistent with the evidence in 
Ag eisbach. 


Ri 
e no bargaili 
equal expe 
rd can set a m 


ae Proposition 3 ‘holds becausé a replacement 
729 Must be monitored more than an incum- 
CEO. The less independent the board 
» the greater the board’s distaste for mon- 
foring, Ko), the greater is the cost to such a 
m hiring a new CEO because he re- 
“more monitoring. Hence, such a board 
willing to tolerate a mediocre CEO 
4 more-independent board (i.e., one 


board would? 
y if his esti 
imated abili g$ 


i 
ive replaced m 
osition 2, any #4 
argaining músa 
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Consider, now, bargaining between the 
board and an incumbent CEO who will be re- 
tained (i.e., one for whom â_> A,). Max- 
imizing (12) with respect to k and w yields 
first-order conditions that are equivalent to 


(13) (V-â -kod'(p)) 
x (pbb + (1 —p)b +w) 
+ (p(V-âĉ)+å 
— kod(p) — w — Uo) 
x (® — 1)b = 0 and 
(14) p(V-â)+åâ-kd(p)-w-— Up 


— (pðb+(1—p)b+w)=0. 


Since & > A,, the board’s expected utility ex- 
ceeds Up, so the second line of (13) is nega- 
tive. The first line must, therefore, be positive, 
which implies 

(15) V—&-—kpd'(p) > 0. 

From the first-order condition for the board’s 
optimal p (8), condition (15) implies that p < 
P#.;,, which, from Proposition 1, implies k > 
ky. We have, thus, established: 


PROPOSITION 4: If the continuing direc- 
tors choose to retain the CEO, then the new 
board will have less independence than did the 
continuing directors (i.e., k > ko). 


We emphasize the word ‘‘continuing’’ be- 
cause the new board is less independent only 
relative to those directors who continue to 
serve. The proposition does not compare the 


~ new board with the previous board (i.e., the- 


continuing and departed directors ) .'° If, on the 


15 To illustrate this point, consider a hypothetical board 
with ten directors: five outsiders and five insiders. Suppose 
that k; = kon for the outsiders and k; = &,, for the insiders, 
Kou < Kin- Suppose, too, that k equals the board’s average 
k. This board’s average k is Vokin + 'hkon. Suppose two 
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other hand, ‘‘normal’’ attrition from the board 
leads to an average level of independence 
among the continuing directors that approxi- 
mates the level of independence of the original 
board, then Proposition 4 suggests an expla- 
nation for the finding that boards become less 
independent over the career of the CEO (see 
-Hermalin and Weisbach, 1988). 

Proposition 4 suggests that corporate gov- 
ernance is subject to a stochastic form of ‘‘en- 
tropy’’: in the long run, boards will be 
relatively ineffective, consistent with the com- 
mon complaint leveled against them. Propo- 
sition 4 is, however, subject to two caveats that 
potentially affect its interpretation. First, a key 
assumption behind this result is that the mon- 


itoring burden is shared equally by the direc- . 


tors (i.e., each must expend p). If the 
monitoring burden could be shared unequally — 
if for instance monitoring was a team pro- 
duction problem such as considered by 
Holmstrom (1982) —then this entropy result 
need not hold. Directors would have an incen- 
tive to free ride on the diligence of other di- 
rectors. This in turn would give them an 
incentive to want new directors with strong 
proclivities for monitoring (i.e., low k;’s) 
rather than, as here, the same proclivity they 
have. In a richer model of board activity, en- 
tropy would, then, depend on the degree to 
which monitoring is a collective activity (as 
here) versus a private activity (as in a teams 
problem). 

Second, we have assumed away any role for 
the shareholders, in keeping with the institu- 
tional evidence that they rarely play a direct 
role in either the ‘‘normal’’ selection of direc- 
tors or the day-to-day operations of the com- 
pany (see Mace, for example). Proposition 4 
serves to emphasize the importance of those 
occasions when shareholders ‘‘break’’ the en- 
tropy through hostile takeovers, proxy fights, 
or direct negotiations between large share- 
holders and management. 


pa a a S a a a 
directors is ?/skin + “/ekou. Suppose, consistent with Prop- 
osition 4, bargaining results in the addition of one insider 
and one outsider. The new board’s average k is ?/ska + 
%/skou. The new board is, therefore, less independent than 
the continuing directors, but is more independent than the 
original board. 







Whether the bargaining maximizes 3 
board and incumbent CEO’s joint surplys af 
pends on whether the limited- -liability og! 
straint binds. If it does not bind, then (ay 
an equality. Using it, (13) becomes 


V2 E 





: e shay ont > eee arn could still 
which is the first-order condition for maxing mance is | 
ing joint surplus (9). If the limited-liabiiyk position 6, 


constraint does bind, then (13) is equivalen er estimat 













V — â — kod' (p) — (1 — ©) Gb =0, 


where ¢ < 1. Consequently, the solution gt 
the problem in which the limited- ~liabiliy 
constraint is binding involves more monity# 
ing and, hence, greater board independent: #7 
than if the constraint is not binding. Thi: i é 
establishes: 


PROPOSITION 5: Suppose that the incun : 
bent CEO is retained. If the limited-liabilst “liyi in [A.. 

constraint is not binding, then the level of *@lso exists an 
monitoring will maximize the CEO alk T 
board’s joint surplus. If it is binding, then tiẸ 
level of monitoring will exceed the Join 






board independence will be greater if the cory: 4 ie and est 
straint is binding than if it is not binding. Ẹ is initially fi 
2 iive, it equ 














affects the ultimate equilibrium level of sctf'..the table anı 
tiny (the probability of being evaluated) thy: ' 
the CEO will face. i 


PROPOSITION 6: The equilibrium probes: ; 
bility that the future board evaluates ant. 
cumbent CEO who is retained is decreas i 
with the prior estimate of his ability. 


Remark 2: Given the monotonic relationshf 
between monitoring and board independe% 
and between first-period earnings performs pensation i 
and estimated ability, Proposition 6 imp 
that performance and the independence 0 

ditions to the board should be negatively | 

related, which is consistent with Hermalit® 
Weisbach’s (1988) findings. if 





Propositions 4 and 6 show that history sf 
ters in corporate governance; that is, we. 












































^ p some hysteresis. Strong, independent 
p will beget stronger, more-independent 
r jpoards than will weak boards. Consequently, 
follow two firms, A and B, over time, 
+ there is a good probability that we will 
A always has a stronger board than B if 
initially has the stronger board. Moreover, 
this could still be true even if B’s recent per- 
formance is better than A’s; indeed, from 
sition 6, better performance by B (i.e., a 
oher estimate of ĉ) could actually accentuate 
the difference between the two boards’ relative 
ngth. These results underscore the impor- 
ce of considering endogeneity in empirical 
studies of corporate governance and they po- 
wntially explain the inconclusive results of 
MacAvoy et al. and others. 
Finally, we consider the relationship be- 
E“ ween the wage, w, and estimated ability. 


PROPOSITION 7: There exists an A, A, < A, 
E such that an incumbent CEO with estimated 
Wg ability in [A., A] is paid a wage of zero. There 
Mee also exists an A, A < A < %, such that an in- 
anges cxmbent CEO’s wage is increasing in his esti- 
U} mated ability for estimated abilities in [A, œ). 


For a retained CEO, the relation between his 
Wage and estimated ability (past performance ) 
Fis initially flat (zero). When the wage is pos- 


19 


itive, it equals one-half times the difference 























estimated abil p between the surplus that the board brings to 
im level of sM the table and the surplus that the CEO brings. 
5 evaluated) W: Whether the wage-is increasing in estimated 


bility depends on whose surplus is increasing 
ter in â. Since the board’s surplus is 
bounded but the CEO’s is bounded, it must 
be that the wage is eventually increasing in 

% <stimated ability. What we have not been able 
We} ' establish is how the wage varies with esti- 
mated ability between A and A.'® Overall, 
Oposition 7 predicts that the level at which 
noncontingent portion of a CEO’s com- 


ed is decre 
ability. 


mic relationshi 
rd independeigy 


ngs perform a tion is set should be insensitive to past 
sition 6 imf Performance for relatively low levels of past 
»pendence of RR >.;. 

E negatively. j 





É 


is 
We have been able to establish it, however, for spe- 
examples, For instance, wage is strictly increasing in 
d ability (above A) if d(p) = 'hp*, ko = 1,7 = 
hs = 1, and b = Ye 
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performance, but sensitive at relatively high 
levels of past performance. 

It is worth noting that even if the CEO’s 
wage is nonincreasing in his estimated ability, 
his overall well-being, w + (p + 1 — p)b, 
is increasing in his estimated ability. 


M. Extensions 


A. The Board Has a Preference for the 
Incumbent CEO 


It is easy to imagine that the board has a 
preference for keeping the incumbent CEO.. 
This could be a result of personal loyalty to 
the CEO— after all, many a directorship is the 
result of close ties between the CEO and the 
director (see, e.g., Mace, 1971). Alternatively, 
an incumbent CEO may take actions to en- 
trench himself. 

Let m be the additional value that an incum- 
bent CEO yields the board. If & is the board’s 
estimate of his ability, then the board will treat 
him as if his estimated ability were & + m = 
&. It follows that the results from the previous 
section continue to hold, except with & replac- 
ing &. In particular, the next proposition is an 
immediate corollary of our earlier results. 


PROPOSITION 8: As the additional value 
that the incumbent CEO yields the board, m, 
rises the following occur: 


(i) the intensity with which the current 
board monitors the CEO decreases; 
(ii) the independence of the future board de- 
. creases; and 
(iii) the minimum estimated ability for the in- 
cumbent CEO at which he will be re- 
tained prior to bargaining decreases. 


(The three results follow from Propositions 1, 
6, and 2, respectively.) 

In other words, Proposition 8 simply indicates 
that the more the board values the incumbent 


` CEO” indépéndent of his ‘ability, the less in- 


tensely he will be monitored by the board and 
the lower the standard to which he will held by - 
the board. These results are consistent with the 
widely held belief that entrenched CEOs or 
CEOs who have cultivated personal loyalty are 
less scrutinized and face lower standards. 
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To the extent m is endogenous, Proposition 8 
predicts that a CEO would undertake activities 
that raise m. An example of such an activity is 
given by Andrei Shleifer and Robert W. Vishny 
(1989). They argue that CEOs attempt to reduce 
the- probability that they will be dismissed by 
making investments that are more profitable un- 


der them than arly replacement CEO. Everr-if ~ 


such investments decrease firm value, a CEO 
has an incentive to‘make them because they raise 
his value vis-a-vis a replacement. 

Proposition 8 identifies another cost of 
entrenchment in addition to Shleifer and 
Vishny’s investment-distortion cost: The more 
entrenched the CEO is, the less intensely he is 
monitored. Consequently, the board is less likely 
to identify problem CEOs who should be dis- 
missed (even if the benefit m must be foregone), 
which further reduces expected firm value. 

This analysis also shows that a CEO is better 
off with his friends on the board (i.e., people 
for whom m is positive). A CEO is, therefore, 
likely to use whatever influence he has to put 
directors who will be loyal to him on the board 
and to ensure the loyalty of those already on 
board.'’ Given this, it is not surprising that 
boards often become interlocked (see Hallock 
for evidence). 


B. Effects of Board Action on Share Price 


How is the value of the firm affected by the 
board’s decision to fire or to retain the CEO? 
We consider the answer at two points: prior to 
when the board could obtain a signal and after 
it would have obtained a signal. Without loss 
of generality, we again assume the incumbent 
CEO offers no additional benefit to the board 
(i.e., m = 0). 

Prior to monitoring, but after the first fire/ 
retain decision, the value of the firm is!® 


U For an extreme example see Bryan Burrough and 
John Helyar’s (1990) discussion of the board of RJR- 
Nabisco. 

18 Our analysis ignores the present discounted value of 
the firm beyond the period considered by our model. This 
is slightly problematic because, as we argued in Proposi- 
tions 4 and 6, we should expect hysteresis across CEO 
regimes. Given, however, the relatively long tenure of 
CEOs (ten years on average—see, e.g., Hermalin and 
Weisbach, 1988), this future omitted part of firm value 













(16) P*V + (1 — P*)â. 


Since P* is decreasing in & (recall Propo 
1), (16) need not be increasing in â. If, howe 


the disutility-of-effort function, d(-), is cong Xi 
enough, then (16) will be increasing-in â. 
LEMMA 3: If ii 
(17) d(p) = d(1) 
| d(1) — d(0) i 
( pared 
Pa a a PY 
Jog(2) ECTP 
f ates wI 
Vp € [0, 1], ed or fir 
then (16) is increasing in â. 
FV 


We will henceforth assume that (17) holds* 

Let FV equal (16) under an incumbent CE) 
and let FV, equal (16) under a replacemer 
CEO. The probability of realizing a firs 
period profit such that the CEO is dismissed 








r+T9 


o(T ta, TTo | 
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where A, is the cutoff ability level defined if 
Proposition 2. Consequently, at the beginnitf 
of the game the firm will be worth 


To tr 
x, = 2 a. 
r 





E{ FV 

















will generally représent a very small portion of the S* 
value. . 4 
1 If d(-) is not convex enough, then (16) may be dea § 
ing in & over some range of &. To see this, suppose d(- 
affine. Then there would exist an &* such that P* =} 
â < &* and P* = 0, for & > &*. Since V > â, this w$] 
imply that (16) must decrease as & crosses â*. 


eS 
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: Aner the first realization of profits, x,, the 
frm's yalue is 


nt xEV + (1 - x)FVo, 


1 ifP*V + (1 — P*)& — kod(P*) 
> Uo 


0 otherwise 


E indicates whether the incumbent CEO is re- 
BS tained or fired. 


From Section II, subsection D, 
FV 4-4, E Kod(Paaa,) 


= FV) — kod (Po) = Us. 







a replacenieame 
lizing a fing 
is dismissed 
















Moreover, from Proposition 1, Pz-4, < Po;” 
ence, FVa-4, < FVo. There thus exists an 
interval of &’s starting at A, such that investors 
would prefer that the CEO be fired, but such 
that the board would prefer to retain the CEO. 
SA Consequently, there is a discontinuous drop in 
the value of the firm at & = A,. Since â and 
i âre monotonically related, we can conclude 











orth ee that firm value, expression (18), is increasing 
t for all values of x, except at 
+ 
x, = a Ay, 
r 
rT )) an there is a discontinuous drop in firm 
EE EG ; We summarize the analysis so far. 
; 'ROPOSITION 9: Higher first-stage profit is 
"To kè Positively related to whether the CEO keeps 





ay fis jo, But higher firm value is not monoton- 
ly related to whether..the- CEO keeps his 


profits such that investors would prefer that 


“Recall P, is the optimal intensity of monitoring for a 
nt CEO 
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. Moreover, there is a range of first-stage ` 
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the CEO be fired, while the board prefers to 
retain him. 


Remark 3: Proposition 9 suggests that earn- 
ings (i.e., xı) should be a better predictor of 
CEO turnover than share price, which is con- 
sistent with the empirical literature (see, e.g., 
Weisbach or Murphy and Zimmerman). 


Proposition 9 and the discussion preced- 
ing it indicates that a tension can exist be- 
tween investors and directors over whether 
the CEO should be fired, with the investors 
preferring to fire and the directors preferring 
to retain. This provides an explanation for 
the common phenomenon of investors seem- 
ing more eager than the board to dismiss . 
management. It can also explain why take- 
overs and other costly control contests can 
be worth mounting. 

Now we turn to the stock reaction when the 
board bases its fire/retain decision on its private 
signal. If the board fires the CEO, then the ex- 
pected value of future cash flows is zero. Prior 
to evaluation, the expected value of future cash 
flows is positive. It follows, therefore, that: 


PROPOSITION 10: The firm’s stock price 
falls if the CEO is fired on the basis of the 
board’s private information. 


Finally, suppose k is not known to inves- 
tors. Ignorance of k does not change Proposi- 
tion 10, so we will focus on what happens 
when the board fires the CEO based on public 
information. For any value of k if the board 
wants to fire the CEO, investors would also 
want the CEO fired. If we imagine a distribu- 
tion over k such that it is uncertain whether the 
board will fire the CEO for a given level of 
first-stage profits, then firing the CEO will be 
considered good news by investors and will 
cause the stock price to rise. The stock price 
will also rise because firing the CEO signals 

. „ {hat the board is relatively more independent 
than was anticipated and will, thus, monitor 
more intensely. This yields: 


PROPOSITION 11: Suppose that the board’s 
independence is unknown to investors. Then 
the firm’s stock price rises if the CEO is fired 
on the basis of public information. 
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Remark 4: Our result that the stock price re- 
action to a CEO dismissal differs depending 
on whether the board used public or private 
information is consistent with, and could 
even explain, the ambiguous relation be- 
tween CEO dismissal and stock price reac- 
tion found in event studies of CEO turnover. 


~ See Warner et al. for a survey of these stud-— 


ies and a similar explanation for their incon- 
clusive findings.’ 


IV. A Nonbargaining Interpretation 


A potential concern is the realism of the 


bargaining game and the extent to which our 
results depend on it. Hence, it is worth rein- 
terpreting the model in a way that does not 
depend on bargaining. 

In this interpretation, the timing is the same, 
except that the bargaining stage, stage 4, no 
longer exists. We also want to reinterpret stage 
5: Let 


p(V- â) + â — kd(p) 


be the firm’s expected profit; where, now, 
kd(p) is the cost of monitoring. We now in- 
terpret k to be a cost parameter known to the 
firm’s decision makers, but possibly unknown 
to investors. 

We assume, now, that a board’s level of 
monitoring, p, is an intrinsic attribute of the 
board. In particular, it is invariant with respect 
to â or other parameters. Boards that monitor 
more-—have a higher p—are more costly for 
the firm than boards that monitor less—have 
a lower p. That is, d’(p) > 0. We assume, 
additionally, that this marginal cost is also in- 
creasing in p (i.e., d’(p) > 0).”! 


2! For example, there could be benefits to having inside 
directors on the board (perhaps to groom them as potential 


_ successors to the CEO—see Richard F. Vancil [1987] on 


this point) or less-intensely-monitoring outside directors 
(perhaps because they bring needed expertise to the firm). 
These benefits are increasing in the number of such direc- 
tors, but with diminishing marginal returns. When these 
directors are seen as the opportunity cost of more- 
intensely-monitoring outside directors, we have d’ (p) > 
0 and d"(p) > 0. 




















We can now reinterpret Proposition r 
statement about board composition and te ; 
derlying parameters. 


COROLLARY 1: Under the alternative’, ee 
terpretation of this section, the level of boa E 
independence, p, is 
(i) decreasing with he prior estimate Of tk 
CEO’s ability, &; 
(ii) decreasing with the precision of 4, 
prior estimate, T, but 
(iii) increasing with the precision of the sch zigh 
nal, s. Ti 


have easier boards), while Corollary 1(i) isg 
substitute for Proposition 6 (board indep 
dence is decreasing with estimated ce i 
ability). i 


this alternative interpretation (although ron 
osition 3 has little economic pee Thy 















pendent on the existence of a bargaining stage 
Rather they are driven by combining a matt 
ing model, similar to Jovanovic’s, with endog 
enous monitoring. Bargaining enables us tE. 
address the central enigma, set forth by Bet 
and Means and others, of how a seemingly 

efficient institution has survived. In particult 
it serves to explain how, why, and when CE0R 
have.a say over who serves on the board. bi 
also serves to explain how, despite this s+ 
the board can still provide a valuable mot 
toring role. 


V. Policy Implications of the Model 


As corporate governance has remained Ë 
sentially the same since the days of nae 
Means, so too have the criticisms and $ 
posed reforms of it. For example, Lipton a r 
Lorsch call for a number of changes, includ 
a smaller board (to reduce free-riding)»!% 








utsiders, more meetings, director pay 
4 to stock performance, and appointment 
*Jead’’ director (if not the chairman) who 
‘separate from the CEO.” These policies, in 

ion and Lorsch’s view, would lead to better 
toring of the CEO. 
OWE =- the context of our model, each element 
Ef the Lipton and Lorsch proposal serves to 
HERE. Ance the k of the board. For example, stock- 
È gased incentives lower the k;’s of all directors, 
ghile replacing high-k insiders with lower-k 
natsiders lowers average k. At first glance, one 
might think that efforts to lower the board’s 
Ms through regulation or political pressure 
É would lead to more effective monitoring. 
=yfhis argument, however, ignores the nature 
the equilibrium in the model. The CEO and 
board bargain over the effective k, which takes 
account of all incentives that potential direc- 
tors will have while they are on the board. As 


a lternatiye’ 
level of bis 



































rviving CH 
llary 1 (ij 
board indep 
stimated Ñ 


‘Se long as the bargaining process is itself unaf- 
le to hold ui: fected by reforms, the equilibrium k will be 
although Pra tinle affected. 


+h follows that we need to distinguish be- 
tween those policies that will affect the bar- 
aining process and those that will not. For 
k instance, requiring a specified fraction of the 

hoard to be outsiders would result in an 
autsider-dominated board, but not necessar- 
ily one that is more independent than the 
insider-dominated board that would other- 
Wise prevail—the CEO and board members 
Will have latitude in the selection process to 
öfiset whatever benefits are created by ex- 
eEcnously imposed ‘‘independence.’’** On 
the ‘other hand, the model suggests that re- 
Guiting incentive pay for directors could 
ve an effect: By lowering ko, this require- 


ts, Propositio 
inder this 


srefore, nol 
argaining 
bining a mald 
c’s, with end iy 


n . 
. This proposal is fairly representative of the many 
teforms that have been proposed by the busi- 
press, academics, and business people themselves 
eet inate Law Institute, 1982; Jensen, 1993; John G. 
tansa a 1995). See, too, Brickley et al. (1997) for a 
-on and evidence on one of these potential reforms, 
Be eation of the CEO and chairman’s positions. 
mce an outsider is simply someone with no other 
chi Corporation, it is hard to imagine that high-k 
anges, inclu Could not be found (e.g., through interlocking 


free-riding} 
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il 


ment would affect the bargaining, leading to 
more independent boards and greater moni- 
toring (see Proposition 6). Moreover, be- 
cause of hysteresis, these benefits can 
persist—although entropy could lead them 
to diminish over time. ~~ 

Of course this analysis begs the question of 
why corporations do not voluntarily adopt ef- 


fective reforms such as this. One answer is that . 


just as the board and the CEO negotiate over 
board composition, they would also negotiate 
over the implementation of reforms. Provided 
his past success gave him sufficient bargaining 
power, the CEO would be able to block or 
blunt such reforms.” 


VI. Concluding Remarks 


A recent Harvard Business Review ‘‘Per- 
spectives Section’’ provides some insight 
into the realism of our model (Smale et al.). 
John Smale, who became the nonexecutive 
chairman of General Motors following 
Robert Stempel’s forced resignation, de- 
scribes policies adopted by the GM board 
that have dramatically improved its effec- 
tiveness. In contrast, Alan Patricof, a leading 
venture capitalist, argues: ‘‘Deep down 
[CEOs] really wish they didn’t have boards. 
That’s why, at the end of the day, most in- 
dependent directors get neutralized in one 
fashion or another (Smale et al., p. 8).’’ A 
model of corporate governance should be 
consistent with both perspectives; it should 
explain both how some boards are active 
monitors of management, yet how some 
CEOs are able to avoid scrutiny. 

By studying the determinants of board com- 
position as a bargaining process, our model is 
consistent with both active monitoring in some 
firms and CEO dominance in others. The pro- 
cess by which GM acquired a strong board is 
illustrative of the model’s logic: The company 
had a crisis induced by poor profits and the 


24 An additional prediction of our analysis is that new 
firms or those with weak CEOs (i.e., firms less subject to 
the agency problems considered here) will be the first to 
adopt reforms. 


te 
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board was forced to act. The new CEO had no 
bargaining power and, thus, had to contend 
with an active board. None of this would have 
happened had the previous managers per- 
formed better; they would have maintained 
their jobs and their control over the board. 
Subsequently, after a period of good perfor- 
mance, GM went back io a more traditional 
arrangement.of have the CEO also serve as 
chairman (Wall Street Journal, December 5, 
1995 p. B1). 

The model is consistent with a number of 
empirical regularities: CEO turnover is nega- 
tively related to performance and this relation 
is stronger when the board is more indepen- 
dent. The probability that independent direc- 
tors are added to the board increases following 
poor corporate performance. And boards tend 
to become less independent over the course of 
the CEO’s career. The model also explains 
why management turnover is more related to 
earnings than to stock returns. Finally, the 
model provides insight into the effectiveness 
of various policies designed to enhance the 
board’s monitoring. 

Despite the mode]’s consistency with exist- 
ing empirical evidence, a number of directions 
for future research remain. One is to model the 
board’s operation in greater detail. For in- 
stance, we have assumed that the board 
chooses a common intensity of monitoring, p. 
What we have not considered is how the board 
implements this choice. For instance, does p 
represent the collective output of the board 
(e.g., what it does at board meetings) or is it 
an aggregate of individual directors’ efforts 
(e.g., carefully reading reports prior to board 
meetings)? If it is the second, to what extent 
is the board able to overcome the problem of 
free-riding endemic to team production (see, 
e.g., Holmstrom, 1982)? Once free-riding 
among directors is an issue, the dynamics of 
board composition become more complicated. 
For example, the continuing directors can re- 
duce their own workloads by adding very in- 
dependent directors (i.e., low-k directors) to 
the board. This, in turn, could offset the en- 
tropy prediction of Proposition 4. 

For example, unlike most American com- 
panies, a German or Japanese company typ- 
ically has strong ties to one particular bank 
and representatives of this bank usually 



























serve on the company’s board.” These w 
resentatives presumably have a strong j 
est in the company’s well-being.’6 
diligence of the rest of the board is unci 
free-riding considerations would tend toy! 
duce their effectiveness, while the banka : 
resentatives have incentives to ensure j 
directors be selected: who will be less lika 
to free ride. Similar issues could be expen 
to arise in family-owned firms in the Unit 
States. 

One limitation of our model is that it foci 
solely on the monitoring role of boards. Ty 
institutional literature (see, e.g., Macey 
Vancil) emphasizes that boards also play iņ 
portant roles providing information and advig 
to management, and serving as a training 
ground for future CEOs. A richer model ¢ 
boards should take into account these rolesy 
well. To the extent they represent opportuni: © sequent 
costs of monitoring [make d’(p) > 0}, the! et 
other roles complement our analysis. 

Our model could also be extended to inyg} 
tigate the transition from an entrepreneuni 
firm to a managerial firm. In this transition, 
entrepreneur (or his or her venture capit 
firm) has an incentive to maximize the vale 
of the firm by minimizing the impact of ty 
entropy problem. 

A last topic for future research would kf: ia 
to consider noncorporate situations whak” 
boards play a monitoring role. For exampki 
universities, trusts, and other nonprofit inst 
tutions all have bodies that function mudg:. 
like corporate boards of directors. Much 
the analysis presented above would set: 
equally applicable to these boards, but $i 
with international comparisons further W 
is worth pursuing.” 





°5 See Kaplan (1994a, b) for recent evidence oË i 
effects of these banking relationships on corporate 
ernance in Germany and Japan. i 

% Although it should be remembered that such is 
tors are themselves agents (of the bank), which coul 
ate a second set of agency problems. Despite this, itis 
reasonable to expect these directors to be more con 
about the firm’s profits than other directors. P £ 

27 See William G. Bowen (1994) for a discuss. $: 
the differences between profit and nonprofit boards." 
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APPENDIX: PROOFS PROOF OF LEMMA 2: 

Consider bargaining with a new CEO. If this 
hi bargaining is unsuccessful, the board can hire 
: OOF OF LEMMA 1: yet another CEO. Hence, from (12), bargain- 
fe PR ing entails maximizing 


=ð + aglH  (P*V — Kod(P*) — w — Up) 











xX ((P*® + 1—- P*)b+w) 
+Z,- avoltt (note FF = 0) 

] ý ; with respect to k and w. Given the monotonic 
: of boards, PA. relationship between P* and k (Proposition 1) 
, €g., Mace =d>0 (recau fess and we can equivalently maximize this product in 
ds also play) TES P* and w. The first-order conditions are 
ation and a i 
g as a trai aaa TE Sa ' (A1 Vo — kod’ (P* 
icher modely a acer a). hee aera? 
nt these rolegy X ((P*® +1- P*)b+w) 
sent oppo tb 
(p) > 0], + (P*Vo ~ kod(P*) ~ w ~ Uo) 
alysis. av ) 
tended to iny a ee x (®-1)b=0 and 
entreprenewi 
ais transition, : 
venture capi ROOF OF PROPOSITION 1: (A2) P*V)—kod(P*) —w— Up 
imize the va Let N be the expression to be maximized in 
e impact 0 7). Consider (i), if &@ = 0, then: — ((P*6+1-—P*)b+w) <0. 
earch ere: i 82 _ ôV- 4) 
tuations WHEE gaap © ga < 0 by Lemma I, In equilibrium, P*V — kod (P*) — w = Uo— 
e. For exa miig p one new CEO yields the board the same utility 
nonprófit by the usual comparative statics, as another new CEO. It follows, then, from 


function ; SOP */0G < 0. Similarly, (A2) that w = 0. Plugging that back into (A1) 


sctors. Muc yields 
re would P- 2Q əv 
boards, DMM +- gkop — 7 (P) < 0; and 5 = ar ` Vo — Fod’ (P*) = 0 


ns further 


! ; This is the first-order condition (8). Since P * 
E os -) 4 JH is monotonic in k, the solution to the Nash 


= ( -1 + em 7G <0 bargaining game is therefore k, = ko. 
PROOF OF PROPOSITION 2: 


r where | the second result uses the fact that Let U* equal the equilibrium expected util- 

he = 0). Hence, OP*/Ok < O-and -- ity of the board if it bargains with the incum- 
Or < 0. Finally, bent CEO. Since Př-z, is the board’s most 

preferred level of monitoring, we know 





mnt-evidence.f 

red that sich fl 

k), which couldy an : əv z 

Despite this, ith ao = =e b> 0, 
ss Op Os 2(7 + s)VH (A3) U® = P#. V+- P#_z,) 


-the usual comparative statics, 2P/Os > z 
x max {0, &} — kod(P7.z,)- 
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Using the envelope theorem, it is readily 
shown that the right-hand side of (A3) is in- 
creasing in â. Moreover, as & goes to infinity, 
the right-hand side of (A3) also goes to infin- 
ity. Differentiating the right-hand side of (A3) 
with respect to 7 using the envelope honem 
yields , oe 5 7 ates 





1 | 


(A4) Pin - Tro <0 


(the option value is decreasing with the pre- 
cision with which the CEO’s ability is esti- 
mated). Suppose, now, that & = 0. It follows 
from (4) and (A4) that 


(A5) PŽ% V — kod(Př-m,) 
< PoVo — kod (Po) 
(A6) = Uo. 


Combining (A3) and (A6) establishes that 
- A, > 0. Since the right-hand side of (A3) is 
continuous and increasing without bound but 
is less than Up for'an estimated ability of 0, it 
follows that A, exists and is unique. 


PROOF OF PROPOSITION 3: 

In the proof of Proposition 2, we established 
that 
— A.) + Ac 


AD. PRax (Vana 


— kod(P#-%) — Uo = 0 
for all ko. Since (A7) holds for all ko, it is an 
identity. Differentiating (A7) with respect to 
ko using the envelope theorem yields 


(A8) [P*6+1 7 





0 


— d(P#_z,) + d(Py) = 0. 


From Proposition 1, PŽ =k) < Po. Hence, since 
d(-) is an increasing function, it follows from 
(A8) that 0A,/Oky < 0. 












limited-liability constraint is ane (ws & : 
and (ii) it is not binding. Begin with case( 
From Proposition 1, the probability that 
CEO is monitored is monotonic in k, at 
maximizing- (12) with respect k is equiva 
to maximizing ; 


(A9) (pV(â,7) + (1 — p)& — kod(p) ~ wh 


X (p®+1—-p) 


with respect to p (since the CEO will be 
tained, we know ĉ > 0). Define W to equi’ 
(A9). By well-known comparative statics p% 
sults, it is sufficient to show 0°/ddap 5 
negative: 


Ow p 4 t4 
Jap = P? 1)?+2(6-1) Fo 
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Using the first-order condition for (A9) i 
(A10) can be rewritten as 


au 
O&Op 





= 2p(® — 1)? + 2(6 — 1) ; last exp; 
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0a 1- l 


Hence, we have 
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and use (6) to rewrite the right-hand side of 
11) as 


ABl) — 1)? + 2(@(z) — 1) 
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implifying, this reduces to 


(p) ~ Up 13) 2(@(z) — 1)? + 2((z) — 1) 
r s $(z)? 
ion for zo(z) + fo" 


traightforward calculations reveal that this 
st Expression is negative for all z = 0 (and, 
etce, for all & = 0). So the right-hand side 
ef (All) is negative, which, from (A11), 


Kkod'(p) $ 
- 3s 
ðĉðp ~~ 


oad te ee fee 


N level of monitori i ‘ 
+2(¢ - DEE na : monitoring satisfies (10): 
. V- â- kod' (p) - (1 - 6)b = 0. 
SLN be the left-hand side of (A14). The 
llows if 89/88 < 0. 
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Ow consider case (ii). From Proposition ` 
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AQ ðP 
Jå =%-1+b3 
o, BBV ~ &— Kod’ (p) 
(A15) = 1+ TEE 
ôV- å 
<t-1+ Z 


where (A15) follows from (A14). Using the 
z-transformation, we have 


an p(z)? 
oo < (z) — 1 = zolz) +: 
re eet arpa re 
Straightforward calculations reveal that this 
last expression is negative for all z = 0 (and, 
hence, for all @ = 0). 


PROOF OF PROPOSITION 7: 
Consider A first. From (14), w = 0 if 


[p(V — â) + & — kod(p) — Up] 
- [p®b + (1 —p)b] < 0. 


At & = A,, the first bracketed term is zero, 

while the second is strictly positive. The ex- 

istence of A then follows from continuity. 
Turn to A. If w > 0, then 


2w = [p(V — â) + & — kod(p) —- Uo} 


os pee + (1 — p)b]. 


The first bracketed term increases without 
limit in â, while the second has an upper 
bound of b. It follows then that beyond a cer- 
tain level, A, that w must be increasing in â. 


PROOF OF LEMMA 3: 
From Proposition 2, we know â = A, > 0 or 
â = Q. Differentiate (16) with respect to â: 


`" ap* 
1+ P*(@-—1)+(V-&)—~. 
d& 





From (8), this can be rewritten as 





= dP* 
1+ P*(@ — 1) + kd’(P*) a 
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Also from (8), it is readily shown that 


dP* ®-1 
dê  kd"(P*)" 


So (16) is increasing in â if 





(A16) 1+(8-— 1)( Ps $ TEn = 


d"(P*) 


Since & = 0, @ = '/, so (A16) holds if 


d'(P*) 
d"(P*) ` 


Define d(p) to equal the right-hand side of 
(17). It is readily shown that 


pgs: 


a"(p) 


so (A17) follows if d’ (p)/d"(p) = d'(p)/ 
d"(p). By (17), d(-) is more convex than 
d(-). By adapting a well-known result on 
when one individual will have a greater 
Asrow-Pratt measure of absolute risk aversion 
than another individual (see, e.g., Chi-fu 
Huang and Robert H. Litzenberger, 1988 p. 
29), it readily follows that d’ (p)/a"(p) = 
d’(p)/d"(p) for all p € [0, 1]. 


2 => P*¥ + 





(A17) 
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L. and Wn The world of the late twentieth century is 
s and T Š much more heavily populated and has much 
irnal of Fin Š higher average living standards than any pre- 
(March 1 vious period in-human history. However, 


Awidely publicized concerns have been ex- 
ed over whether per capita real income 
continue to increase or even be maintained 


Model of Renewable Resource Use 
By JAMES A. BRANDER AND M. Scott TAYLOR * . ks 


This paper presents a general equilibrium model of renewable resource and 
population dynamics related to the Lotka-Volterra predator-prey model, with 
: man as the predator and the resource base as the prey. We apply the model to 
ha}. > the rise and fall of Easter Island, showing that plausible parameter values gen- 
‘erate a ‘feast and famine’’ pattern of cyclical adjustment in population and 
<" resource stocks. Near-monotonic adjustment arises for higher values of a re- 
source regeneration parameter, as might apply elsewhere in Polynesia. We also 
. describe other civilizations that might have declined because of population over- 
shooting and endogenous resource degradation. (JEL Q20, N57, J10) 


however, producing interesting new informa- 
tion on the role of natural resource degrada- 
tion. Specifically, a pattern of economic and 
population growth, resource degradation, and 
subsequent economic decline appears more 
common than previously thought. A major 


` question of present-day resource management 


me and Sh èt current levels in the face of rapid population ` 
owent Bevowth and environmental degradation. Econ- 

Wa mnl normally tend to be skeptical about 
: Directors! y 




















h claims, largely on the basis of the histor- 


fave worried about overpopulation and envi- 
ental degradation, yet the past, at least as 
Hagine it, seems to provide a record of 
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Ogical and anthropological evidence is, 
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is whether the world as a whole, or some por- 
tion of it, might be on a trajectory of this type. 
A first step in addressing such concerns is to 
construct a formal model linking population 
dynamics and renewable resource dynamics. 
The primary objective of this paper is to 
construct such a model. The second objective 
is to apply this model to the very interesting 


case of Easter Island which, until recently, has’ 


been one of the world’s great anthropological 
mysteries. Our model contains three central 
components. The first component is Malthu- 
sian population dynamics, following Thomas 
R. Malthus (1798). Malthus argued that in- 
creases in real income arising from productiv- 
ity improvements (or other sources) would 
tend to cause population growth, leading to 
erosion and perhaps full dissipation of income 
gains. He also suggested that population 
growth might overshoot productivity gains, 
causing subsequent painful readjustment.' 


! Malthus was not a fatalist. He believed that enlight- 
ened public policy could reduce population growth, con- 
templating both contraception and ‘moral restraint’ as 
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The second component of the model is an 
open-access renewable resource. Malthus as- 
serted (Ch. 10 pp. 82-83) that the negative 
effects of population growth would be worse 
in the absence of established property rights. 
Property rights are a particular problem with 
renewablé resources such as fish, forests, soil, 
and wildlife. Thus, if we are to observe Mal- 
thusian effects anywhere, we are perhaps like- 
liest to see them where renewable resources 
are an important part of the resource base. The 
extreme version of incomplete property rights 
is an open-access resource, where anyone can 
use the resource stock freely. Malthus did not 
provide a clear formulation of open-access re- 
newable resources, but we can take this step, 
drawing on the modern theory of renewable 
resources. 

The third component of the model is a Ri- 
cardian production structure at each moment 
in time. Thus the model might reasonably be 
referred to as a Ricardo-Malthus model of 
open-access renewable resources. The com- 
ponents of the model are relatively simple. 
Even so, the model exhibits complex dynamic 
behavior. For example, one possible outcome 
of the model is a dynamic pattern in which, 
starting from some initial state, population and 
the resource stock rise and fall in damped cy- 
cles. A change in parameters, however, can 
shift dynamic behavior toward monotonic ex- 
tinction of the population or might lead to 
monotonic convergence toward an interior 
steady state. 

The model provides a plausible explanation 
of the rise and fall of the Easter Island civili- 
zation. Applying the model to larger and more 
complex modern resource systems would re- 
quire an expanded model structure, but the 
simple model presented in this paper provides, 
at the very least, insights that should be con- 
sidered in evaluating current renewable re- 
source management practices. 


mitigating factors. He noted that reduced fertility arising 
from social responses (such as increased age of marriage) 
was the major check on population in Western Europe. 
Other checks included reduced fertility and increased mor- 
tality arising from poor nutrition and increased incidence 
of disease. Famine, in his view, was only nature’s last 
resort, and he noted that in much of Europe ‘‘absolute 
famine has never been known’’ (Malthus, 1798 p. 61). 


tion VI discusses the role of institu 

















(1817), and the pioneering work on renew; ses E 
resources of H. Scott Gordon (1954) an fas 
M. B. Schaefer (1957). Resource dynam 
have been studied by many scholars, and apy 
ticularly valuable ‘overview of this Mater’ rary 
(with considerable original work) is Colinwky 
Clark (1990). The particular resource mog ara 
used here is due to Brander and Taylif gs 


(1997), and is also related to Anthony pf as : 
Scott and Clive Southey (1969). Detailyi’ gegnological 
modeling and estimation of particular reng civiliz. 
able resource stocks has been carried out by pean dis 
many scholars including, for example, Jeg much | 
Didier Opsomer and Jon M. Conrad (199) it had b 
Careful studies of Malthusian population ¢f-Thus'the eco: 
namics include Maw-Lin Lee and David}fFgne'of rising ` 
Loschky (1987) and George R. Boyer (198i tiwed by dec 
The claim that environmental constraints g g most v 


and Lester R. Brown (1995). 
Applying formal economic analysis to sé 
archaeological mystery is an unusual activi 


In particular, Vernon L. Smith (1975) usdi 
formal model of hunting to explain the extinf:ton 
tions of large mammals during the late Plag.-p 

















explain human prehistory more generally. 15 
idea parallels evolution in the field of archkg’ ppor 
ology itself, where mathematical models% Hates 
increasingly used as aids to interpreting piji 
ical evidence. A valuable overview of this4-M0y 


Section I provides a brief descriptiod 
Easter Island’s past. This past is fascinatltg'y 
its own right, as are the methods by wh %4 
has been uncovered, but our main goal 83: 
provide background to our approach. Sei 
II presents our general equilibrium moc? $; 
resource use and Malthusian population %4 
namics. Section MI analyzes populations% 
resource interactions, and states the £ 
propositions characterizing the dynamica 
havior of the system. Section IV appt” 
model to Easter Island and Section V des 
other cases where the model might apply: 
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— Section VII contains concluding 


I. Easter Island 


and (also called Rapa Nui) is a 
par aR over 2,000 miles (3,200 
from the coast of Chile, with a population 
of the early 1990’s) of about 2,100. For 
past two centuries, Easter Island has been 
i ted as a major archaeological and an- 
fitropo -ological mystery. In particular, the Poly- 
fj Se zation in place at the time of first 
= qpean discovery in 1722 is known to have 
Keen much poorer and much less populous 
dan it had been a few hundred years earlier. 
3 the economic record in Easter Island is 
2 of rising wealth and rising population, fol- 
flowed by decline. i 
$! The most visible evidence of Easter Island’ s 
g standards gast glory consists of enormous statues (called 
orgaard (19MF‘moaj"’), carved from volcanic stone. Many 
Wie aatues rested on large platforms at various lo- 


psource moai 
r and Ta b 
D Anthony aes 
69) . Detal 
icular ci 
carried ou 
example, fa 
onrad (1994 
population & 
+ and Davidi 
, Boyer (198% 
constraints 












































analysis canons on the island. The largest ‘‘movable’’ 
unusual act tues weigh more than 80 tons, and the larg- 
hout precedi@™eat statue of all lies unfinished in the quarry 
1 (1975) ustaehere it was carved, and weighs about 270 
plain the exiagiminas The puzzling. feature of the statues and 
g the late Pgmifatforms is that the late stone age Polynesian 
+ (Smith, ‘$@itulture found on Easter Island in 1722 seemed 
omic model, ble of creating such monumental archi- 
e generally: . First, the culture seemed too poor to 
: field of aft oft a large artisan class devoted to carving 
tical modél tues, and certainly no such group existed in 
iterpreting H teenth century. Also, the statues were 


éd substantial distances from the island’s 
quarry to their destinations, but the pop- 
On, estimated at about 3,000 in 1722, 
hed too small to move the larger statues,” 

without tools such as levers, rollers, 
and wooden sleds. However, the island 
22 had no trees suitable for making such 


view of thisi 
nneth RUF 


f descriptio 
tis fascinatiiy 


{0 move the-statues, and believed that the 


isa significant literature devoted to methods of 


the dyna i 
yn onstruction and movement. Even 3,000 peo- 


m Iv vies Rot have moved the larger statues without the 

>chon ert Wooden sleds and levers. See Paul Bahn and 

might apply; axey (1992) and Jo Anne Van Tilburg (1994) for 
f statue transportation. ed 


: of instituy 


Local residents had no knowledge of - 
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statues had walked to the platforms under the 
influence of a spiritual power. 

Various theories have been advanced to ex- 
plain these statues and other aspects of Easter 
Island. The most well-known theory is due to 
Thor Heyerdahl (1950, 1989), who argued 
that native South Americans had inhabited 
Easter Island (and other Pacific islands), had 
built Easter Island’s statues, and had subse- 
quently been displaced by a less advanced 
Polynesian culture. To support his thesis, 
Heyerdahl traveled on a balsa raft, the Kon- 
Tiki, from off the coast of South America to 
the Pacific islands. A more exotic theory of the 
‘‘Atlantis’’ type, proposed by John Macmillan 
Brown (1924), is that Easter Island is the tiny 
remnant of a great continent or archipelago 
(sometimes called ‘‘Mu’’) that housed an ad- 
vanced civilization but sunk beneath the 
ocean. A still more exotic theory, proposed by 
Erich von Daniken (1970), is that the statues 
were created by an extraterrestrial civili- 
zation. Two recent books describing the cur- 
rent understanding of Easter Island are Bahn 
and Flenley (1992) and Van Tilburg (1994). 
This understanding does not support the 
Heyerdahl, ‘‘Atlantis,’’ or extraterrestrial 
theories of Easter Island, but fits well with 
the Ricardo-Malthus model of open-access 
resources. 

Recently discovered evidence suggests that 
Easter Island was first settled by a small group 
of Polynesians about or shortly after 400 A.D., 
The pollen record obtained from core samples - 
and dated with carbon dating methods shows 
that the island supported a great palm forest at 
this time. This discovery was a major surprise 
given the treeless nature of the island at the 
time of first European contact. In the years fol- 
lowing initial settlement, one important activ- 
ity was cutting down trees, making canoes, 
and catching fish. Thus the archaeological rec- 
ord shows a high density of fish bones during 
this early period. Wood was also used to make 
tools and for firewood, and the forest was a 
nesting place for birds that the islanders also 
ate. The population grew rapidly and was 
wealthy in the sense that meeting subsistence 
requirements would have been relatively easy, 
leaving ample time to devote to other activities 
including, as time went on, carving and mov- 
ing statues. 
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Noticeable forest reduction is evident in the 
pollen record by about 900 A.D. Most of the 
statues were carved between about 1100 and 
1500.* By about 1400 the palm forest was en- 
tirely gone. Diet changed for the worse as for- 


est depletion became severe, containing less _ 


fish (and thus less protein) than earlier. Loss 
of forest cover also led to reduced water re- 
tention in the soil and to soil erosion, causing 
lower agricultural yields. Population probably 

_ peaked at about 10,000 sometime around 1400 
A.D., then began to decline.* The period 1400 
to 1500 was a period of falling food consump- 
tion and initially active, but subsequently de- 
clining, carving activity. 

Carving had apparently ceased by 1500. At 
about this time, a new tool called a ‘‘mataa’’ 
enters the archaeological record. This tool re- 
sembies a spearhead or dagger and is almost 
certainly a weapon. In addition, many island- 


ers began inhabiting caves and fortified 


dwellings. There is also strong evidence of 
cannibalism at this time. The natural inference 
is that the island entered a period of violent 
internecine conflict. However, at first Euro- 
pean contact in 1722 no obvious signs of war- 
fare were noted. This visit (by three Dutch 
ships) lasted only a single day, however, and 
much may have gone unnoticed. 

The next known contact with the outside 
world was a brief visit from a Spanish ship in 
1770, followed in 1774 by a visit from James 
Cook, who provided a systematic description? 
of Easter Island. There had been some change 
between 1722 and 1774: Most noticeably, al- 
most all of the statues had been knocked over, 
whereas many had been standing in 1722. 
Statue worship, still in place in 1722, had dis- 
appeared by 1774. Population was apparently 


* Radiocarbon dates are available in Van Tilburg (1994 
p. 33). These dates are attributed to unpublished work of 
W. S. Ayres. 

* This population estimate appears several places in the 
literature and is often attributed to W. Mulloy. Others have 
suggested that the reasonable range for the population 
maximum was between 7,000 and 20,000, with most fa- 
voring about 10,000. 

5 Cook had a Tahitian crew member who could com- 
municate quite easily with the Easter Islanders, as Tahitian 
and the Easter Island language were similar. See P. V. 
Kirch (1984 Chs. 3 and 11). 
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less numerous than it had been in 1722 o 
was estimated at about 2,000.6 

Thus Easter Island suffered a sharp dca 
after perhaps a thousand years of apparel gus f 


sequently fell in tandem with disintegrationg fihi is pe 
the existing social order and a rise in viola . form 
conflict. Kirch (1984 p. 264) suggests th Zaned en\ 
‘‘Easter Island is a story of a society which 
temporarily but brilliantly Surpassing j 
limits—crashed devastatingly.” 

The mystery of Easter Island’s fall is pegth 
garded by many as solved. In simple fomf goss 
the current explanation is that the island's R 
degraded their environment to the point tà 
it could no longer support the population s$ san) 1 
culture it once had. However, Polynesians te The 
most always dramatically altered the envir goods. 
ments of the islands they discovered. Whyég soure 
environmental degradation lead to populatk E 
overshooting and decline on Easter Islan 
but not on the other major islands of Po} the 
nesia? Furthermore, there are 12 so-calhg küp 

‘‘mystery islands” in Polynesia. Theses ess), wh 
lands were once settled by Polynesians b Keusing, 
were unoccupied at the time of European Œ 
covery. All these Polynesian islands repress 7000 & 
pieces of ‘‘data’’ that should be consist taim 
with whatever theory is proposed as an6 is 
planation for Easter Island. rE 


























II. The Ricardo-Malthus Model 
A. Renewable Resource a 


The resource stock at time f¢ is KON: ; 
Easter Island it is convenient to think of eat 
resource stock as the ecological complex Ry 
sisting of the forest and soil. The change H 
stock at time ¢ is the natural growth, 





é In 1862 the population was reliably estimated f: pee 
3,000. In 1862 and 1863, slave traders from Peru od tibe 
the island and took about one-third of the popula 
slaves. Many of these slaves died of smallpox. Ie 
turned to the island, inadvertently causing a smallp®, Ta 
idemic that killed most of the remaining Islanders,” 
the population reached its low point of 111, fronit 
it subsequently increased by natural increase a! 
migration from Tahiti and Chile. 














































), minus the harvest rate, H(t). Drop- 
time argument for convenience, 


Gg(S(O 


sg the 


a sharp 

ars Of apps dS/dt = G(S) — H. 

ilation roseg . 

: level and? è use the logistic functional form’ for G, 
lisintegratigame sich is perhaps the simplest plausible func- 
a rise in yas, form for biological growth in a con- 


grained environment. 


ociety whi 
surpassing G(S) = rS(1 — S/K). 
ind’s fall i : ‘the “carrying capacity,” is the maximum 


g 
2. 

5 
o, 
© 

ei 


ties that further growth cannot occur. Vari- 
o the poinfaiBable r is the ‘‘intrinsic’’ growth (or regenera- 
population} ) rate. 

Polynesiangg@e’ The economy produces and consumes two 
red the envingitssods, H is the harvest of the renewable re- 


rvered. Why į: grce, and M is some aggregate ‘‘other 


ad to populaj Seaod,"’ In the case of Easter Island, we think 
| Easter Islas the (broadly defined) harvest as being food 
slands of Polit(ie., agricultural output from the soil and fish 
re 12 so-calgeaught from wooden vessels made from 
esia. These fmtrees), while good M would include tools, 
Polynesians bq housing, artistic output (including monumen- 
f Europear@me! architecture), household production, etc. 
slands repré Good M is treated as a numeraire whose price 
d be consis ormalized to 1. Aside from resource stock 


EJ 


osed as arë 
BR 
de 


3, the only other factor of production is labor, 
£, We make the inessential simplification that 
weeebor force L is equal to the population. Good 
sof is produced with constant returns to scale 
Eng only labor. By choice of units, one unit 
bor produces one unit of good M. Since 
price of good M is 1, the wage (denoted 
y'w) must equal 1 if manufactures are 
produced, 
al complex We assume that harvesting of the resource 
he change if “attied out according to the Schaefer har- 
zal growth, “sting production function (proposed by 
aefer [1957]) as follows, 


H’ = eSLy 


` ki H í 

i dati re YP : ; 
ee iNest is the harvest supplied by producers. 
of the popula% eo P stands for “‘production’’.) ` 
smallpox. A‘ € labor used in resource harvesting and 


ising a smallp 
ng Islanders: ; 
of 111, from ; 
increase and Si * Major results can be extended to the case of a 


Sempensatory growth function. See Appendix B. 


ble size for the resource stock, as S = K. 


“13) = 
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@ is a positive constant. Letting a,,,(S) rep- 
resent the unit labor requirement in the re- 
source sector, (3) implies that az,(S) = Ly/ 
H?” = 1/(a@S). Production in both sectors is 
carried out under conditions of free entry. Be- 
cause of open access there is no explicit rental 
cost for using S, and the price of the resource 
good must equal its unit cost of production. 


(4) p= Wā = w/(asS). 


A representative consumer is endowed with 
one unit of labor and is assumed to have in- 
stantaneous utility given by the following 
Cobb-Douglas utility function 


(5) u = hm! ~? 


where h and m are individual consumption of 
the resource good and of manufactures, and 6 
is between 0 and 1. Maximizing utility at a 
point in time subject to instantaneous budget 
constraint ph + m = w yields h = wĝ8/p and 
m = w(1 — £8). Since total domestic demand 
is L times individual demand, we have 

(6) HP = wßLlip; M? =w(1 —- B)L 
where superscript D represents demand. 

At a moment in time the resource stock is 
fixed, the population (and labor force) is 
fixed, and the economy’s production possibil- 
ity frontier is given by the following full- 
employment condition. 


(7) RB’”a(S) +M =L. 


A linear production structure of this type is 
referred to as a Ricardian production structure 
(after Ricardo, 1817). The Ricardian tempo- 
rary equilibrium can be determined algebrai- 
cally by substituting (4) into (6) (i.e., the 
supply price equals the demand price) to ob- 
tain temporary equilibrium resource harvest, 


H = afpLsS. 


The equilibrium output of M is M = (1 — 
B)L, implying that manufactures will always 
be produced and therefore that wage w = 1. 
At a Ricardian temporary equilibrium, the 
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Panel A: Resource Dynamics 
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(Growth Function) N 







Resource Stock, S 


Panel B: Population Dynamics 


b-d+F = b-d + þaßS 


(dL/dt)/L (net fertility) 






0 





Resource Stock, S 


b-d 


FIGURE 1. A RICARDO-MALTHUS STEADY STATE 


harvest will not necessarily equal the under- 
lying biological growth rate of the resource. If, 
for example, temporary equilibrium harvest H 
exceeds biological growth G, then the stock 
diminishes. Substituting (2) and (8) into (1) 
yields the following expression for the evolu- 
tion of the resource stock. 


(9)  dS/dt = rS(1 — SIK) — @BLs. 


If the resource stock falls, then labor produc- 
tivity in the resource sector falls, the Ricardian 
production possibility frontier shifts in, and 
this establishes a new temporary equilibrium 
with a lower harvest. 
_ Panel A of Figure 1 illustrates a typical 
steady state (i.e., where dS/dt = 0) using the 
harvest function (8) and the resource growth 
function (2). (Ignore the dashed lines for 
now.) As shown in the figure, stock S* implies 
that harvest H = afLS* just matches resource 
growth G(S) at S = S*. Thus S* is a steady 
state if the actual population level is L. 


` sumean underlying proportional birth r 




























B. Malthusian Population Dynamijcy- 


Our discussion of Figure 1 so far imp 
treats population as fixed at size L, alloy: My 
us to focus purely on resource dynamics ym 
now consider population dynamics. We, 


and an underlying proportional death rate (K9 
Thus the base rate of population increas 
(b — d), which we assume to be negative 
plying that without any forest stock or ary? 
soil the population would eventually dig 
pear. However, consumption of the resopg pty), 
good increases fertility and/or decreases mieit 
tality, and therefore increases the rate of w 
ulation growth." In particular, the populaġ 
growth rate is given by 


pated isac 


Sady state 


(10) dLidt = L[b —d + F] 
where F = $H/L is the fertility function, 
¢ is a positive constant. Thus higher per cap 
consumption of the resource good leads 
higher population growth. It is in this s% 
that population dynamics are ‘‘Malthusian’®: 
Noting from (8) that H/L = aBS, equi 
(10) can be rewritten as 


(11) dL/dt = L(b — d + baBS). 


Equations (9) and (11) forma two-equal : 
system of differential equations character 
the evolution of the Ricardo-Malthus me 
These equations are a variation of the LOW: 
Volterra predator-prey model.'° Human pag if. 
ulation, L, is the ‘‘predator’’ and the reso 


3 One might let net fertility depend on total con® 
tion rather than on resource consumption, althov 
case could be argued either way and has little eH S nP. 
the analysis. ërior 

° Among modern high-income socjeties, pop 
growth is negatively correlated with income at 
country and individual level. Most premodem S- 
appear to exhibit Malthusian population dynamics en 
higher consumption causes higher population go% ; 

10 Predator-prey systems have sometimes bee? 
in renewable resource economics. See, in particulat 
L. Ragozin and Gardner Brown (1985) and Philp 
(1990). 




































§, is the ‘‘prey.’” We concentrate first 
<qeady-state analysis, then turn to the sys- 
s dynamic behavior. 


A. Steady-State Analysis 


The system given by (9) and (11) will have 
a steady state if dL/dt and dS/dt are simulta- 
ously equal to zero. There are three solu- 
ons, as expressed in Proposition 1. 


PROPOSITION 1: The Ricardo-Malthus 
adel exhibits three steady states. Steady state 
‘isa corner solution at (L = 0, S = 0). Steady 
tate 2 is a corner solution at (L = 0, S = K). 
ady state 3 is an interior solution at 


E oD L= (7/08) 
ae x [1 — (d — b)/(¢aBK)], 
S = (d—b)/(ba8). 


PROOF: 

The proof follows immediately from simul- 
taneously setting equations (9) and (11) to 
i$: zro and can be seen by inspection. To obtain 
p steady state 3, one can solve for S from ex- 
i pression (11), then substitute this value in (9) 
obtain L. Note that steady state 3 implies 
ditive values for L and S as d > b,-and r, 
and @ are all positive. The steady-state 
ck, S, must be less than carrying capacity 
‘Steady state 3, S = (d — b)/(daB) so 
sistency requirement can be written as 


a two-equiz 

charactena 
Malthus m0 
n of the 1% 
(d — b)/(haB) < K. 


}) comes to equality or reverses direction, 
Steady state 3 collapses to steady state 2. 


focus on the interior steady state. As 
BS im Proposition 4, when (13) holds, the 
ren por j Steady state is stable whereas steady 
income at bis and 2 are saddlepoints. The interior 
premodern SAN =- State can be illustrated graphically by 
on dynamics © e two panels of Figure 1. Panel A rep- 

€ resource dynamics conditional on 
ee L. Panel B captures the population 
sxva Siven in (11) by graphing the per- 
: of change of the. population (dL/ 
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dt)/L on the vertical axis, and the resource 
stock on the horizontal. The upward-sloping 
line is net fertility (b — d + F), which is linear 
in S as F = aps [substituting (8) into the 
definition of F]. 

At any stock S < S*} there is an excess of 
deaths over births and the population shrinks; 
at any stock greater than S* the population 
grows. At stock $* population growth is zero 
and the population level is stationary. We de- 
note this level as L*. Therefore, at resource 
stock S*, the population is stabilized at L* and 
the harvest of the resource is just equal to its 
underlying growth rate, implying that S is also 
stationary. The two panels together therefore 
illustrate an interior steady state. Using ex- 
pression (12), or Figure 1, we can determine 
how changes in exogenous parameters affect 
interior steady-state resource stocks and pop- 
ulation. Proposition 2 follows by inspection, 
and Proposition 3 is obtained by differentiat- 
ing L as given in (12). 


PROPOSITION 2: The steady-state resource 
stock 


(i) rises if the mortality rate rises, the birth 
rate falls, or fertility responsiveness 
falls; 

(ii) falls if there is technological progress in 
harvesting; and 

(iii) is unaffected by changes in the intrinsic 
resource regeneration rate, r, or carry- 
ing capacity, K. 


PROPOSITION 3: The steady-state popula- 
tion level 


(i) rises equiproportionately with an in- 
crease in the intrinsic rate of resource 
growth, r; 

(ii) falls when harvesting technology im- 
proves if S < K/2 and rises if S > K/2; 

(iii) falls when the taste for the resource 
good rises if S < K/2 and rises if S*> 
K/2; and 

(iv) rises if the carrying capacity of the en- 
vironment rises. 


It follows from (8) and (12) that per capita 
steady-state resource consumption, h, is (d — 
b)/ġ. Per capita consumption of the other 
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good is (1 — #). Thus steady-state resource 
consumption is determined by demographics, 
and rises if the birth rate. falls or if fertility 
responsiveness falls.’' It is instructive to see 
how population growth dissipates gains from 
_ resource productivity improvements. Suppose 
the economy is at the steady state in Figure 1 
and r rises. The growth curve will shift up as 
shown by the dashed line in Panel A. At stock 
S*, the harvest is then less than resource 
growth and the resource rebuilds. Per capita 
consumption of the resource good rises, caus- 
ing population growth, and the harvest func- 
tion in Panel A rotates upwards (as L rises). 
Since nothing has altered the demographic 
steady state in Panel B, the resource stock must 
return to $*, but with a higher population and 
unchanged per capita real income. 


B. Dynamics 


We now characterize the dynamic behav- 
iour of the system. We assume that parameter 
restriction (13) is met, implying that an inte- 
rior steady state exists. 


PROPOSITION 4: When an interior steady 
state exists, the local behavior of the system is 
as follows. 


(i) Steady state 1 (L=0, S=0) is an unstable 
saddlepoint allowing an approach along 
the S = 0 axis. 

(ii) Steady state 2 (L = 0, S = K) is an un- 
stable saddlepoint allowing an approach 
along the L = 0 axis. 

(iii) Steady state 3 (L > 0, S > 0) is a stable 
steady state and a “‘spiral node’’ with 
cyclical convergence if 


r(d — b)i (Kab) 
+ 4((d — b) — KbaB) < 0. 


(14) 


H Another easily derived point of interest is that a de- 
cline in the birth rate causes steady-state resource output 
to rise if S < K/2. This possibility is unusual for Malthu- 
sian models, but arises here because of the resource 
dynamics. 


„sumption (represented by ġ) or if che h 
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(iv) Steady state 3 is a stable steady state 
an “improper node?’ allowing mo 


PROOF: 
See Appendix A. 


Proposition 4 is based on the fact that Gy: 
dynamic system is locally linear in the neidfi 
borhood of a steady state. Since Proposition it 
shows that the interior steady state is localf üs follov 
stable, our earlier comparative steady-state ie. 
ercises (Propositions 2 and 3) are meaning 
in that small perturbations i in parameters 4 ie 
lead to small changes in steady-state vag. =: 
and allow convergence to a new steady st#&- 

Condition (14) i is central in understandgg 
the model’s local dynamics. It can be rewritg, 
as r < 4KbaB(KéaB — (d — b))/(d - te 
Noting from (13) that the right-hand sid? 
this inequality must be positive, one intep 
tation of (14) is that a slow enough rately 
intrinsic resource growth insures a locally f 
clical trajectory. Conversely, given r, (14) 
dicates that cyclical dynamics will ocot 
fertility is very responsive to per capita & 


vesting technology is very efficient (i.e. 
is high). 

To examine the global properties of 
model, consider the phase diagram in 
2. Population L is on the horizontal aži 
resource stock S is on the vertical axis "a 
horizontal line labeled dL/dt = 0 derives 






















- fine, then dL/at > 0, and if the system is 
ool it, then dL/dt < 0. The other line, la- 
vel dS/dt = 0, is obtained from expression 
which implies that dS/drt = 0 if S = K — 
pO /r)L. Above this line, dS/dt < 0 and 
eS ue interior steady state. The directions 
motion for each of the four regions in the 
a are shown by the right-angle arrows. 
: Consider, for example, point A, which 
‘might represent ‘first arrival’’ (1.e., a small 
‘population and the resource at carrying capac- 
gy). Point A is above the dS/dt = 0 line, im- 
‘al ing that the resource stock must be falling, 
and is also above the dL/dt = 0 line, implying 
rising population. The figure shows one pos- 
sible adjustment path toward the steady state, 
bat other types of adjustment are also consis- 
tent with the arrows of motion, including 
f monotonic adjustment toward the steady state. 
É Proposition 5 characterizes the global ap- 
ei ‘proach to steady state conditional on different 
i ¥: starting points. 
iH PROPOSITION 5: When an interior steady 
Propositio: state exists, the global behavior of the system 
tate is loch& is as follows. 
eady-state ái; 
ing (1) If L > 0 and S = Q, the system ap- 
i proaches steady state 1 with L = 0 and 










y-state vali 
steady sta 
understandil ae: 






(il) If L = 0 and S > O, the system ap- 
‘proaches steady state 2 with S = K and 






s a locally 
ven 7, ( 14)8 
' will occu 
jer capita Comm 
or if the ha 








Itis striking that the system converges to an 
terior steady state from any interior starting 
t. There are two important parameter re- 








a 
perties off 


BS Wality (13). If this inequality is not satis- 
ram in Be : 


system crashes toward zero popula- 











“y Stable improper node. Thus, if (13) is 
‘shed, our model implies extinction of 


dS/dt > 0. The intersection of these ~- 


Stictions underlying this property. Thé first is” 


site sy 
“aan this case, steady state 2 becomes a - 
glotal LNA 
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the human population and a restoration of the ` 


resource base to carrying capacity. 

The second parameter condition is given by 
(14) which, as shown in Appendix A, deter- 
mines whether the linearized system in the 
neighborhood of the interior steady state has 
complex or real roots. If the linearized system 
has complex roots, then all trajectories exhibit 
cyclical adjustment sufficiently close to the 
steady state. If the linearized system has real 
roots, then all trajectories approach the interior 
steady state along a path increasingly close to 
the dominant eigenvector of the system. Such 
a trajectory may be globally monotonic and 
must be locally monotonic. 

If we start far away from the steady state, 
then it is more difficult to describe the paths 
of the system completely, but many qualitative 
features of the global system follow from an 
understanding of the local analysis. For ex- 
ample, suppose the system is perturbed from 
an initial steady state by the instantaneous dis- 
appearance of some fraction of the predator 
population. Cyclical behavior arises if the 
predator grows quickly in response to this 
shock while the resource grows slowly. In this 
case, the quick growth of the predator causes 
it to overshoot its new long-run level. The now 
overabundant predator then reduces the prey 
below its steady-state level, and this in turn 
causes a decline of the predator population be- 
low its steady-state level. But when the pred- 
ator declines, the prey rebuilds and overshoots 
the steady state, leading to a resurgence of the 
‘predator, which again overshoots, etc., tracing 

out a damped cycle with an overshooting pred- 
ator population chasing a slowly adjusting 
prey toward the steady state. This interpreta- 
tion is consistent with condition (14), which 
shows that adjustment must be cyclical if r 
(the intrinsic growth rate of the prey) is suf- 
ficiently low or if ap (the growth response 
of the predator to a change in the resource 
stock S$) is sufficiently high. 

This description describes the forces af- 
fecting the local behavior of the system near 
a steady ‘state, but it also applies to global 
behavior. A complete analytical characteri- 
zation of all possible trajectories as a func- 
tion of parameters is difficult to develop, but 
the limiting case in which the resource stock 
adjusts instantaneously to its steady-state 
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value is instructive. In this case the economy 
would always operate on the dS/dt = 0 lo- 
cus. As the founding population is (by as- 
sumption) smaller than its steady-state 
value, the economy would move down the 
dS/dt = 0 locus until it reached the steady.. 
state. In this case, when the predator adjusts 
slowly and the ‘prey adjusts infinitely 
quickly, both the resource stock and the pop- 
ulation level adjust monotonically toward 
their steady-state values. 

Technically, we can use the condition 
dS/dt = 0 to solve for S as a function of L 
[from (9)] then substitute this in (11) to get 
a one-variable differential equation in L. The 
solution is logistic: population grows fast at 
first then levels off, as in the standard descrip- 
tion of Polynesian islands. 


IV. Applying the Ricardo-Malthus Model to 
$ Easter Island 


In this section we use simulations of the 
Ricardo-Malthus model to make two points. 
First, by choosing parameters that are consis- 
tent with our knowledge of Easter Island, 
Polynesian civilizations generally, and other 
Neolithic populations, we are able to generate 
a time series for population size and resource 
stocks that appears to (approximately ) repli- 
cate Easter Island’s past. We take this as ten- 
tative support for our theory of the rise and fall 
of the Easter Island civilization. Second, pa- 
rameter changes can change the time pattern 
of population and resource stock evolution 
from extreme cyclical overshooting to mono- 
tonic adjustment toward the steady state.'? We 
take this feature of the model as a possible 
explanation as to why Easter Island appears to 
be different from other Polynesian islands. 


A. Parameter Choice 


Some parameter values are simply a matter 
-of scaling; such as the carrying capacity of the 


12 The key parameter difference that we consider in ex- 
plaining the difference between Easter Island and other 
Polynesian islands is substantial. The model also implies 
bifurcations (in parameter space) around which major 
changes in predictions arise. =. 
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forest/soil resource complex. It is conye 
for the stock to be similar in magnitude iy 
population, so we let the carrying Capacity, 
the resource stock be 12,000 units. This jg a 
starting value of the stock when Polyneg, 
colonization first occurred. (The forest hy 
been in place for approximately 37,000 y 
before first colonization, so carrying capaci 
had certainly been reached.) 
The next parameter to consider is q, lal 
harvesting productivity. The productivity of 
unit of labor is aS. One unit of labor com 
sponds to the amount of labor one person ty 
provide in one period. It is convenient to ky 
ten-year intervals be periods. If we let q 
0.00001, this means that if S = K,a househli 
could provide its subsistence consumptig! 
(the amount just necessary to reproduce its 
in about 20 percent of its available labor tj 
Accordingly, there is considerable surplus y 
the island when the resource stock is la 
This seems roughly consistent with know§ : 
information. 


vesting the resource. The other sector incluk 
manufacturing and service activities. Vain. 
pieces of evidence suggest that the resoung < 
sector probably absorbed somewhat less the: 
half the available labor supply. A value of 
for is probably in the reasonable range. ; 
Another important parameter is r, the 
trinsic growth (or regeneration) rate of thet} 
source. We initially assume an intrinsic gro, 
rate of 0.04, implying that, left to itself, 
forest/soil complex would increase by 4 jt 
cent per decade in the absence of conge 
effects (i.e., if the stock’ were small comp 
to the carrying capacity). The remaining} 
parameters are the demographic parame 
Let (b — d) = —0.1 and let ġ = 4. The vi 
for (b — d) means that the population Ww} 
decrease by.10 percent per decade in 
sence of the resource stock. Letting ¢ =4 
plies that there would be positive pop 
growth if the stock were approximately 50 
cent of its carrying capacity, and negative! 
ulation growth otherwise. Througholl! 
simulation period annual population 8% 
never exceeds 1 percent per year, whit 
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RE 3. EASTER ISLAND BASE CASE 


sis ont with the demographic literature on 
thic populations. The range of estimates 
the founding population ranges from 20 to 
Dd of more, with 40 being commonly used 
‘ss ‘a'plausible estimate. We take the starting 
fyclue to be 40, but varying this estimate makes 
faite’ difference to the results. These parame- 
tera yield the time-series pattern shown in 
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ste’” for the 
ay of trying 
1 that 6 ise 
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K Figure 3 shows an interesting dynamic pat- 
Oem, For the first 300 years, humans have little 
dE inpact on the resource. Population then begins 
Fist increase rapidly and the resource stock falls 
pceipitously for the next 800 years. About 
4000 years after discovery the initial population 
v4 40 has grown to about 10,000. The period 
mable range EES High population (and high labor supply) in 
ster is r, tÆ simulation corresponds to the period of in- 
n) rate of WSE ase carving in the archaeological record. The 
1 intrinsic gogpsimulated resource stock reaches its trough 
left to itself mabout 250 years later, close to 1500 A.D., as 
icrease by 4} per capita resource consumption. Recall 
i few weapon (the ‘‘mataa’’) appears in 


ce of conga ; 
> small compa chaeological record at about this time, 





















1e remaining. gence of cannibalism also appears, and 
phic parang 48 movement to fortified structures and 
p = 4. They as dwellings. The simulated resource 
,opulation W9 begins its recovery but population con- 
iecade in thé to fall, implying a 1722 population of 


Uxlmately 3,800 to meet the Dutch ships, 
t from the 3,000 actually estimated. The 
ated 1774 population is about 3,400, 
ewhat more than the 2,000 Cook es- 
In the 1800’s there is substantial out- 


tervention so our model would no 
hold. 


„etting ġ zi 
sitive popu% 
oximately 50% 
ind negatives 
Througho; 
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B. Why Is Easter Island Unusual? 


Our model allows both cyclical and*mono- 
tonic behavior so it offers the potential to ex- 
plain both the cyclical overshooting that . 
occurred on Easter Island and the monotonic 
behavior apparently observed on the major 
Polynesian islands. There is no réason to be- 
lieve that Easter Island was an outlier in its 
underlying demographics, its tastes, or its 
technology. However, it was an outlier in one 
very important respect. The palm tree that 
grew on Easter Island happened to be a very 
slow-growing palm. This palm (which was the 
single most significant component of the for- 
est/soil complex on Easter Island) is now 
known (due to J. Dransfield et al., 1984) to 
have been a species of Jubea Chilensis (the 
Chilean Wine palm). This palm tree grows no- 
where else in Polynesia, and it is perhaps the 
only palm that can live in Easter Island’s rel- 
atively cool climate. An authoritative text 
(Alexander M. Blombery and Tony Rodd, 
1982 p. 110) reports that ‘‘Cultivation pre- 
sents few problems in a suitable temperate cli- 
mate, but growth of these massive palms is 
slow and it is generally later generations who 
get the benefit from their planting.’’ Under 
ideal conditions, the Jubea palm requires about 
40 to 60 years before it reaches the fruit- 
growing stage, and can take longer.'* 

In contrast, the two most common large 
palms in Polynesia are the Cocos (coconut 
palm) and the Pritchardia (Fiji fan palm). Nei- 
ther of these palms can grow on Easter Island, 
and both are fast-growing trees that reach 
fruit-growing age in approximately seven to 
ten years. For a resource based on these palms, 
it would be more reasonable that the intrinsic 
growth rate would be about 0.35 or 35 percent 
per decade.'* Figure 4 shows a simulation that 


'3 This information is based on private communication 
with palm growers. Easter Island was also an outlier in 
rainfall and temperature, contributing to. slow, growth of 
the resource. 

14 Translating ‘‘time-to-fruit”’ into intrinsic growth rate 
ris difficult, as trees continue to grow well after first yield- 
ing fruit and we are interested in the entire forest/soil 
complex in any case. Associating a 40-year time-to-fruit 
with r = 0.04 and a seven—ten-year time-to-fruit with r = 
0.35 is plausible but very rough. Also, the trees were not 
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FIGURE 4. POPULATION AND RESOURCE DYNAMICS WITH 
Fast REGENERATION 


is identical to Figure 3 except that the growth 
rate is raised from 0.04 to 0.35. 

The higher intrinsic resource growth rate 
causes the population to adjust more smoothly. 
In fact, this simulation is technically cyclical, 
but the cycle is so muted that the adjustment 
path is virtually monotonic, as the population 
peaks at 42,245 before leveling out at its 
steady-state value of 41,927. The population 
trajectory would not become strictly mono- 
tonic unless the intrinsic growth rate exceeded 
0.71, but even at moderate growth rates of 0.15 
or 0.2, the population ‘‘crash’’ would be too 
small to be evident to archeologists. Low 
growth rates, on the other hand, produce dra- 

matic cyclical fluctuations. 

Thus an island with a slow-growing resource 
base will exhibit overshooting and collapse. An 
otherwise identical island with a more rapidly 
growing resource will exhibit a near-monotonic 
adjustment of population and resource stocks 
toward steady-state values. Even if everything 
else were similar across islands, this one fact 
would allow the Ricardo-Malthus model to be 
consistent with both the spectacular overshoot- 
ing and collapse on Easter Island and the far 
less dramatic development exhibited on other 
major Polynesian islands. _. 

The model is also consistent with the 12s S0- 
called ‘‘mystery islands’’ that were once set- 


the only element in the forest/soil complex, which further 
complicates the problem of estimating the intrinsic growth 
rate for different hypothetical situations. 

































tled by Polynesians but were unoccupied ` 
European contact. All but one of these islat 
have relatively small carrying capacities, Ae tion 
plying our model, we observe that if K js «Bice, 
ficiently small then condition (13) will noth 
satisfied and there will be ng ‘‘interior’’ steal 
state. A colonizing population could atin, 
but would eventually drive the resource Stoj 
down to a level that would cause extinction 
the human population. 4 wh: 
Another noteworthy Polynesian settleng nnar 
is New Zealand’s South Island. The South} #se4yondu 
land had a high concentration of large figi tgs 
less birds (up to 10 feet tall and 500 pou% 
in weight) called ‘‘Moas.’’ First Polynesig 
settlement is thought to have occurred aroujé 
1000 a.p. (although it may have been later} 
Following settlement the South Island Mant 
(or ‘‘Moa-hunters’’ ) lived ‘‘high on the birt'§ 
by hunting Moa, along with fishing and agi: 
culture. Over this period there was substantgg 
deforestation and the Moa were driven toe 
tinction. It is not clear exactly when the M 
became extinct, but the larger species dis 
peared first, possibly lasting as little as 2X - 
years following settlement. There is some di 
agreement over whether population overh 
then declined, or whether it merely stagnat 
as the Moa disappeared.'° However, the Sout 
Island was more densely settled than the Not 
during the Moa-hunting period, but at Ew 
pean contact (about 1700) settlement WE 
denser in the more temperate and warm 
North. Thus it is possible that the South Isi# 
exhibited population. overshooting, as wo% 
be consistent with the slow-growing resog 
base (consisting of Moa and slow-gro nar 
forests). ue evide 
While Easter Island and Polynesia m 0 
generally offer interesting applications 
model, the significance of our analysis W 
be greatly expanded if the basic approach st 
also relevant for other cases. In the follow 


zi 


ulation overshoot did occur, stating that ‘‘the pi 
one of ultimate population decline, with a gradual 04 
in cultural energy owing to an insurmountable de 
resources.” A good reference on the Moa is / 
Anderson (1989). 











































snous resource degradation have played 
portant role in the decline of other 


+, 
wa: 


V. Other Applications 


Our model might be consistent with the Ma- 
BF on collapse which, like Easter Island, was long 
ny aa Jed as a mystery. The Mayan empire oc- 
í Eo. “ied what is now the Yucatan Peninsula of 
ian settler: ico; and parts of Guatemala, El Salvador, 
W <4 Honduras. The empire reached its peak in 
neriod 600—830 A.D., then suffered a rapid 
ine in both population and cultural sophis- 
tion over the following 100 years. The civ- 
ration was partially rebuilt in an outlying area 
at this later Mayan civilization went into de- 
zijne about 1200 A.D. Only fragments of the 
bE Mayan people and civilization still existed at 
ANE vhs time of Spanish conquest in 1521. 


OMe By the early ninth century, agricultural output 
Se could no longer support the dense population 
fi the region, leading to out-migration, a sharp 
géecline in population, and a collapse of the 
soplilsticated civilization that had developed. 
main unresolved question concerns the 
to which drying and erosion were the 
sogenous result of human activity rather 
ait arising from exogenous climate changes. 
€vidence suggests that both factors were 
tuportant. The following quotes from Culbert 

88 pp. 99-101) provide a now widely ac- 
-Septed interpretation. 


All available data show that populations 

he Southern lowlands rose rapidly to 

Late Classic peak. Not only was the 

{Spilation unusually dense (200 per sq. _ 

ans ut it covered an area too large to 
Ow adjustment through relocation or 


ante: 7 

hat ae pict tion. „Maya agriculture became 
1a gradual dogg ingly intensive as population 
untable dede Overextension— overshoot in 
he Moa is'Ag s terminology —often afflicts 


lex systems during a period of ex- 
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pansion... . In the Maya case, the over- 
extension was ecological and consisted 
of a population system dependent upon 
maximal results from a subsistence sys- 
tem that made no allowance for long- 
term hazards. 


Culbert’s description of Maya seems similar 
to the operation of our model. Maya achieved 
high population density through intensive ag- 
riculture that, at best, left no margin of safety 
when climatic conditions changed marginally 
for the worse and, at worst, created an endog- 
enously determined agricultural shortfall aris- 
ing from deforestation and soil erosion. 

Resource degradation also played an impor- 
tant role in the decline of the ancient Mesopo- 
tamian states (in what is now Iraq). Various 
civilizations and empires rose and fell in parts 
of this region between the period 2350 B.c. and 
600 A.D. The first true empire was the Akka- 
dian Empire (2350—2150 B.c.). Soil samples 
from the Akkadian region indicate increasingly 
intensive agricultural land use until about 2200 
B.C., when the soil in the northern part of the 
empire became too dry to support the popula- 
tion (H. Weiss et al., 1993). The entire northern 
portion of the empire was abandoned, causing 
a major migration into the South, which in turn 
strained food and water supplies in the South 
to the point of civic collapse and breakdown of 
central authority (Ann Gibbons, 1993). By 
2150 B.c. the empire had degenerated into a 

. group of independent city states. 

Later civilizations made extensive use of ir- 
rigation. Joseph Tainter (1988) writes ‘‘In this 
area, agricultural intensification and excessive 
irrigation lead to short-term above-normal har- 
vests, with increasing prosperity ... [but] the 
rise of saline ground water erodes or destroys - 
agricultural productivity.” By the end of the 
third dynasty of Ur (about 2000 B.c.), agricul- 
tural yields per unit of land had fallen by about 
50 percent since the first dynasty (about 300 
years earlier), and about twice as much seed 

-per unit of land was required even to achieve 
this lower yield. (See Robert McCormick 
Adams, 1981 p. 151.) Declining agricultural 
productivity was an important contributing fac- 
tor to the decline of Ur. 

Other major Mesopotamian civilizations, 
including the Assyrians, the Babylonians, and 
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the Sumerians suffered the same salinization 
problem suffered by the Ur. Gradually, 
through successive empires based in different 
parts of the region, nearly the entire area be- 
came infertile. According to Tainter (1988), 
by about 1200 A.D. the total occupied area in 
the region had fallen to pertiaps 5°6r 6 percent 
of its earlier peak, and most previously fertile 
land was uninhabitable. Furthermore, while 
climate has fluctuated in the region over the 
past few thousand years, there is no discern- 
able trend in precipitation (as described by 
Adams, 1981 pp. 12-13). Therefore, while 
the relative contribution of natural climate 
change and human activity to the destruction 
of a major agricultural region cannot be pre- 
cisely determined, human agricultural prac- 
tices seem to have been the dominant factor. 
A less well-known example concerns the 
Chaco Anasazi in the southwestern United 
States. Between about 1000 A.D. and 1150 
A.D. the Chaco Anasazi built an impressive 
system of roads, settlements, and ‘‘great 
houses.’’ As described in Stephen H. Lekson 
and Catherine M. Cameron (1995), the largest 
great houses had about 700 rooms and are 
thought to have been administrative centers for 
a complex trading system. In this period, pop- 
ulation grew rapidly, probably through immi- 
gration as well as natural increase. This region 
is prone to significant rainfall variations, and 
a dry period began in 1134, lasting until 1181. 
After 1134 no new great houses were built, 
much of the land was abandoned, and the elite 
culture of the Chaco Anasazi disappeared. The 
puzzle here is that this drought was no more 
severe than droughts of the previous century 
that had been weathered without strain. 
Tainter (1988) suggests that Chacoan Anasazi 
economic organization had already been 
pushed beyond its limits by population growth 
and that the moderate drought of 1134 pushed 
an already overloaded system into collapse. 
The Maya, the Anasazi, and the ancient Me- 
sopotamian civilizations all show a similar 
‘pattern. In each case, decline of the resource 
base, particularly soil degradation, was the 
main factor precipitating a population crash 
and the decline of a complex civilization. 
While exogenous climate fluctuations may 
have played a significant role in these cases, 
population growth and endogenous resource 



































degradation were also important, making 
similar to Easter Island. $ 

Overall, evidence on soil change and 
information recently obtained from cores: 
ples has significantly changed the wey Mo à 


in major civilizations. The role of warfare x 
violent conflict is also being ecli a 
Rather than being the cause of. decline, Viol 
conflict is commonly the result of resoy R$ 
degradation and occurs after the civilizaj 

has started to decline, as on Easter Island. pe 


VI. Institutional Adaptation 


A critic might object that our analysis a! 
derestimates institutional adaptation. We ¢! 
sume an open-access resource, but perhaps 
should expect more efficient resource mana 
ment institutions to evolve. This is prima 
an empirical question.!” Elinor Ostrom (19 8 
has studied the historical record on commg:: 
property problems and argues persuasivg 
that efficient institutional reforms sometim 
occur in primitive (and advanced) socie 
but sometimes do not. (See also Ostrom ets 
1994.) In Ostrom (1990 p. 21) she 
‘*some individuals have broken out of the t$% 
inherent in the commons dilemma, wherig-?: 
others continue remorsefully trapped into dy 
stroying their own resources.’’ She also 0 
serves (1990 p. 210) that ‘‘we cannot [ado 

. a presumption that appropriators will aoq§ 3 
new rules whenever the net benefits of ais 
change will exceed net costs.’ 

The main objective of Ostrom (1980) 
to determine the factors that favor effi 
institutions and those that impede efficient 


'€ Itis also possible that exogenous climate chang% 
have affected Easter Island, as emphasized to us in d 
communication by Grant McCall. See also J. ia 
(1991). ; 

.’ Various theoretical arguments can also be sid gS 
to explain why a society may not undertake an efi 
enhancing policy change. Raquel Fernandez and EC 
Rodrik (1991) model a status quo bias against S ee < 
forms that arises when individual gainers and lose ae ‘ 
reform cannot be readily identified ex ante. 5 
Alberto Alesina and Allan Drazen (1991) provi as 
in which reforms are delayed if different groups 
tempt to shift the burden of adjustment to other go, 
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int, Maki a al response. The most important fa- for seedlings would almost never have been of 


is factor is an agreed-upon and correct direct benefit to the cultivators. 
change ang: ~ anding Of the. problem. If a soil ex- The Easter Islanders did make some institu- 
| from core’ on problem is falsely attributed to low tional changes in response to resource scarcity. 
the way may then the response might be more rain One major change was that at some point the 
past. Reg "=< rather than restructured property rights. Easter Islanders abandoned the system of statue ~ 
d to be com ase a modern example, it was not possible worship and apparently pushed over almost all 
e of Warfagg settle on an effective response to ozone de- of the statues (usually facedown). Institutional 
g Teinterprg ón until there was substantial agreement changes of this type, amounting to religious 
‘decline, vA «tq problem existed and on the mechanism revolution, are clearly very costly and, while 
sult of regi ng the problem. Even then, obtaining they are understandable responses to declining 
the civilis ensus was difficult, and several major circumstances, they are unlikely to have helped 


aster Island 
ig 


 antries have refused to participate in the re- the underlying resource-use problem. 
rT $ 
tation (am 


international agreement. The other kind of institutional reform sug- 
Iris also helpful if proposed rule changes af- gested by our model is population control. There 
gct relevant parties in a similar way rather than is a large and fascinating literature on social in- 


4 


ur analysis generating winners and losers. Other favorable stitutions affecting fertility. Perhaps the main 
tation. We SE rctors include low discount rates and low en- point is simply that the range of such institutions 
but perhaps $ &xcement costs. It is also helpful if the affected is very large. Some societies adopted practices 
source ma ‘group is small and if the group has a high level that directly limited population growth, includ- 
ris is primi sf initial trust and sense of community. Thus, ing infanticide and genital mutilation. More sub- 
Ostrom (199MM és example, if there are existing ethnic or social tle and benign approaches involving marriage 
rd on comi divisions that dominate the way people perceive customs and crude contraceptive methods were 
S persuasi T Misses, this makes appropriate institutional also used. Such population controls tend to be 
mms someta change difficult to achieve. These conditions fol- density dependent. For example, as crowding in- 
ced) societammm law from the general principle that institutional creases and resources become more scarce, it is 
> Ostrom etm ebange is more likely to occur when the individ- more difficult for young men to acquire suffi- 
'1) she wagmmsals who must make the change are confident cient wealth to marry, and both infanticide and 


out of the NEE thst they will be among the beneficiaries. contraception would be practiced more fre- 
mma, whea -Easter Island did not present afavorableen- quently. This was the pattern elsewhere in Poly- 
tfenment for efficient institutional change. It nesia and was probably true of Easter Island as 


Pa 


She also unlikely that the Islanders understood the well. This is consistent with our model, as it 
sannot [aie 6logy of the forest-soil complex or the likely means that net fertility falls when per capita re- 


tors will a tcentive effects of alternative institutional ar- source consumption falls. 


eG 
efits o g feigements. It is even possible that individual However, in societies that lack a clear sci- 
ig landers did not recognize that depletion was entific understanding of their world, institu- 
om ( 1990} <ing place. Although the forest disappeared tional adaptation involving fertility, property 
avor efficky ly by archaeological standards, change rights, and other matters is likely to be a ‘‘trial 
le efficient ee a low over the course of an individual life and error’’ process. Efficient institutions 


Typical life spans for those who survived would probably be achieved only after a long 
cy would have been on the order of 30 period of time and many trials. It would be 


mate change » and even during most rapid depletion, difficult for a society like Easter Island to 


zd to us in iy forest ‘stock would have declined by no adapt efficiently in á single boom and bust 
oJ. Flenley ! + than 5 percent over a typical lifetime. cycle. aia 
also be aie " ii ar problem had been recognized, the sx 


~< years taken for a tree to mature would 
then working life of virtually all island- 
lus a program of replanting and caring 


ke an efi 
andez andi 
against sue 


; and lose. 

ante. Sit? We have used the life tables in Kirch (1984 pp. 112-14) 
provide af 1 , to calculate that life expectancy at age 5 is slightly less 
t groups S Ongevity estimates are derived from the four than 30 years, and that the percentage surviving until their 


> other gaan Series available for prehistoric Polynesia. mid-forties would be approximately 5 percent. 
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The other declining civilizations also seem 
to be poor candidates for efficient institutional 
reform. We do not have space to discuss each 
case in detail, but anticipation and understand- 
ing of the ecological problem would have been 
limited, and conflict between competing 
groups would have delayed reform. Even in 
sophisticated societies like Maya and Meso- 
potamia, where property rights may have been 
relatively secure, the complex array of exter- 
nalities involved in soil erosion, salinity and 
declining water tables would have been hard 
to address successfully. 

We recognize, however, that modern knowl- 
edge of institutions in preliterate civilizations is 
very limited. In considering Polynesia, we know 
there was substantial variation across the differ- 
ent islands. Therefore, one alternative hypothesis 
to ours is that institutional variation across so- 
cieties within Polynesia might be the explanation 
for contrasting growth experiences. 


Vi. Concluding Remarks 


This paper presents a simple model of re- 
newable resource growth and population dy- 
namics and employs this model to provide a 
plausible account of the rise and fall of the 
Easter Island civilization. For reasonable pa- 
rameter values, the model generates a boom 
and bust cycle in which population grows, the 
resource base is degraded, and population ul- 
timately falls. This cycle arises because the re- 
source base has a slow regeneration rate. A 
faster-growing resource would allow mono- 
tonic convergence toward the steady-state 
population. Thus the model can explain the 
difference between Easter Island and other 
Polynesian islands based on known differ- 
ences in resource growth rates. 

Our analysis has several lessons for the 
modern world. First, the model implies that 
changes in technology, the environment, or 
human behavior can create feast and famine 
cycles that may be a recipe for violent conflict 
over apparently diminishing resources. Easter 
Island “iiay“be only one casé of ‘many where 
unregulated resource use and Malthusian 
forces led to depletion of the resource base and 
social conflict. Identifying countries at risk in 
the modern world may be difficult, but our 
model provides some guidance as it identifies 


cases where resource degradation civas 


































the key parameters that make cyclical i 
turns more likely. : 
A modern case that might be consisten 
our model is Rwanda, which entered the 
during 1994 because of. a violent civi] 
This war was normally attributed to ethnics 
sions between Hutus and Tutsis, but py 
careful analysis ‘suggests the possibility 
Malthusian population growth, resource g.# 
radation, and resulting competition for gt ir 
sources was at the root of the conflict. Betwg ; 
1950 and 1994, population in Rwanda qu 
rupled. The boom began in the 1950’s wh 
advances in health care and agricultural pý 
tice led to increasing real incomes and rig 
net fertility. By the 1980’s what had been i 
open frontier was ‘‘filled up,” and real liy lve 3 
standards started to fall. Conflict over landt ET 
tween Hutus and Tutsis became increasig 
severe, culminating in a civil war in whig 
significant fraction of the population wH 
killed and a very large fraction (perhaps $$ 
percent) became refugees. Thomas Home 
Dixon (1994) describes several other mod 


and demographic dynamics, might be hel 
in understanding such situations. 
A second lesson of our analysis is thti: $3% 
provides one of the first formal empirical BS ews 
amples in which the cycles that arise in nt 
linear models appear relevant in analyzif 
long-run economic development. In shor, 
suggests that nonlinear dynamics are likely 
be relevant in studying economic growth, # x 
pecially in situations where renewable & a 
sources are important. 
Third, our analysis of Easter Island anid 
other cases suggests that economic decs ER 
based on natural resource degradation 18 val 
uncommon. Institutional change could polé 
tially have averted collapse in many of 
societies but it was not undertaken (or at i 
was not undertaken fast enough). Institu 
failure in renewable resource use does-haj 
and it has been fatal for several societies. <% 
cent events in the world’s major fisheries "4 a 
gest that institutional change remains 
An extreme case is the (Canadian) New! 
land cod fishery which was closed in 1 
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iad stocks were down to less than 5 percent 

Or peir 1960 levels. Our work offers support 

‘the position that it is both important and 

difficult to reach efficient institutional arrange- 
jn renewable resource use. 


lications, we do not wish to embrace the 
E essjmism Of modern neo-Malthusians. 
First, in considering the modern world, one 
tition s¢°4Me onld introduce nonlinearity in the response 
flict. Be fertility to consumption so as to allow for 
wanda @ demographic transition of the type ob- 
1950's: „ed in modern high-income societies. 
cultural ifically, net fertility declines with in- 
es and pal come at sufficiently high-income levels. In 
had bee r model, cyclical dynamics arise only 
d real REE when fertility has a strong enough positive 
over landas: response to per capita consumption, so in- 
i i eorporating a demographic transition would 
probably allow the possibility of escape 
pulation at iom cyclical dynamics if a high enough in- 
(perhaps M come could be reached. 
bmas Hom: In addition, our model abstracts from tech- 
other moda Salogical progress, which is the main force 
on driven ame émphasized by growth optimists. Abstracting 
iolent coat í ‘fram technological progress is reasonable for 
dards. Mogae cur discussion of Easter Island, and perhaps 
plicit resa £6 Most of the other examples we have dis- 
ght be hèl x edased, but it would clearly be a serious omis- 
7 ie £00.in considering the modern world. The 
ysis is th r el can, however, be readily augmented in 
A ui direction. Both the r parameter (resource 
growth) and the æ parameter (harvesting ef- 
ciency) could be viewed as susceptible to 
ef exogenous or endogenous technical 
4 Progress. Furthermore, progress in the form 
further scientific understanding may also 
late institutional adaption in resource 
‘This would require a larger modification 
the model as it implies a different charac- 
tion of temporary equilibrium in the 
a ce Sector. On the other hand, the pos- 
ee Sbility of demographic transition notwith- 
HE, net fertility (particularly declining 
ty) is affected by technical progress, 
-aaeases in net fertility driven by im- 
its in medical technology may well 
Population overshooting. 
» While technical progress has been the 
force in the growth process since the 
of the industrial revolution, this last 
a small fraction of the time that 


ains diih 
Be 

1) Ne WIE 

d in 19 a 


nally, despite our model’s rather gloomy ` 
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humans have harvested from the earth and built 
complex, but ultimately fragile, societies. It is, 
for that matter, shorter than what might be re- 
garded as the ‘‘golden age’’ of Easter Island. 


7? APPENDIX A 


PROOF OF PROPOSITION 4: : 
Let (L*, S*) represent a steady state. Define 
the vector u = (u,, us) = (L — L*, S — S*). 
Thus u is the vector of deviations in L and S 
from a particular steady state. It follows that 
du,/dt = dL/dt and dus/dt = dS/dt, where 
aL/at and dS/dt are given by (11) and (9). 
Using a Taylor series expansion for du/dt 
around u = 0 [i.e., around (L*, $*)], it can 
be shown (as in William E. Boyce and Richard 
C. DiPrima, 1992 pp. 450-51) that du/dt can 
be expressed as follows. 
(Al) duldt = J(L*, S*)u + R(L,S) 
where J is the Jacobian matrix of first-order 
partial derivatives of dL/dt and dS/dt with re- 
spect to Z and S, and R(L, S) is a remainder 
of higher-order terms that can be ignored near 
u = 0. J is evaluated at (L*, S*). Denoting 
the components of J as Ji, Ji2, etc., in the 
obvious way, we can write this linear system 
as 


Ja Sin | | uz 

A2 du/dt = : 
( ) H a 
A two-equation system of linear differential 
equations [as in (A2)] has a general solution 
of the form 
(A3) u(t) = c, E,e2!" + eine. 
where cı and c» are constants, zl and z2 are 
the eigenvalues of coefficient matrix J, and E1 
and E2 are the corresponding eigenvectors. 
The dynamic behavior of the system depends 
on whether zI and z2 are real, “complex, or 
imaginary, and on whether any real part of 
these eigenvalues is positive or negative. The 
system is explosive if z1 and z2 are positive 
real numbers, and converges monotonically to 
the steady state if z] and z2 are negative. If z1 
and z2 are complex numbers then cyclical be- 
havior emerges. The coefficients of J can be 
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determined by taking partial derivatives of (9) of characteristic equation (A6), the Prog) PON 
complete. fs 


and (11). 
(A4) Jı =(b-— d) + baBS; Jn = ba BL; 
Ja = —OBS; Ja = r — 2rSIK — aBL. 


(i) For steady state 1 (L= 0, S = 0); coef- 
ficient matrix J becomes 


b-d | 


(A5) J(0, 0) = | o 7 


The eigenvalues are the diagonal elements 
b — d < 0and r > 0. This combination of one 
negative real eigenvalue and one positive real 
eigenvalue implies that steady state (0, 0) is 
an unstable saddlepoint. 


(ii) and (iii) For steady state 2 (L = 0, § = 
K) and steady state 3 (L = (r/aZ)(1 — S/K), 
S = (d — b)/dafB)) we proceed in the same 
way, making the appropriate substitutions in 
matrix J using (A4) and calculating the as- 
sociated eigenvalues of J. For steady state 2 
the eigenvalues are —r and (b — d) + dafK 
{which is positive by (13)]. As with steady 
state 1, the combination of positive and neg- 
ative real eigenvalues implies that steady state 
2 is an unstable saddlepoint. For steady state 
3, matrix J has nonzero off-diagonal elements, 
so the eigenvalues cannot be seen by inspec- 
tion but must be obtained as the roots of the 
characteristic equation. Letting S* denote the 
steady-state value of S, the characteristic equa- 
tion is z(rS*/K + z) — (d — b)r(1 — S*/ 
K) = 0, which is quadratic in z and has roots 


(A6) z=[—-rS*/K+ ((rS*/K)* 


—4(d—b)r(1 —S*/K))!?]/2. 
If the discriminant of (A6) is positive (or 


zero), then both solutions for z must be neg- 
ative real numbers and the steady state is a 


stable node with monotonic convergence. A 


negative discriminant implies complex eigen- 
values with a negative real part, and the steady 
state is a stable spiral point. The system then 
exhibits damped cycles that converge on the 
steady state. Noting that equation (14) given 
in the proposition is simply the discriminant 
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PROOF OF PROPOSITION 5: 


(i) 


(ii) 


(iii) 


The paper uses the logistic growth 
tion, but the analysis is readily gener 




























fa 


Proposition 4 establishes that steady. spt 
1 (L = 0, S = 0) is a saddlepoint all ye 

ing a local approach along the horizta 
axis, and we can tell from inspection 
the full nonlinear system that stead 
1 is reached from any point along 
horizontal axis. If S = 0, and L > 0, & 
L must fall to zero. Thus one trajec 
of our full nonlinear system is given} 
the horizontal axis in Figure 2. = 
We know from Proposition 4 that stat R87 
state 2 (L = 0 and S = K) is a saddlepk§ 
allowing a local approach along the mf 
tical axis, and we can tell from inspect 
that steady state 2 is reached frome Reset 


Peg woke 


ward level K, hence another trajectory 
our full nonlinear system is given by 
vertical axis in Figure 2. 

As the differential equation system is 
tonomous and continuously differents 
ble, no two trajectories of the system% 
intersect. Any trajectory that starts freg 
a point strictly interior in Figure 2 map 
remain strictly interior. ( Otherwisti§ 
would touch one of the axes, which 
know is impossible, as the axes theg 
selves are trajectories.) We can thenty 
vide equation (9) by (11) to obtain gy 
slope of any system trajectory as OY 
dL = [rS(1 — SIK) — aBLS\{LO§ 
d + @a$S)]. Inspection of this sloe* 
each region of the phase diagram in Mg 
ure 2 implies that the direction of Bg 
trajectory must eventually be inwardi Lan ree 
wards the interior steady state. Limil ayy 
cles can be ruled out by apply my 
theorem due to Kolmogorov as pro"! 
by Robert M. May (1973 pp. 85- 
Therefore, the trajectory must app 
‘the steady state. 
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a ‘eneral compensatory (bent over) 

8. sanction provided that G(0) = G(K) = 
poi that G(S)/S is strictly decreasing in S. 
Danis-case we let r = lims- G(S)/S. Equa 
sag (9) becomes daS/dt = G(S) — aZLs. 
ye tion (11) is unchanged. Proposition 1 fol- 


mmediately, except that in steady state 


a 
Steady (ame squat 
É lows i 


point align 
e hori F- K 
Be. P ) as before. Proposition 2 follows im- 

gediately. To obtain an analog of Proposition 

‘we need to replace the experiment of in- 
-easing r by an overall increase in G(S)/S 
znd, instead of focussing on whether S < K/2 
§ > K/2, we focus on whether G(S) is in- 
ceasing or decreasing. Figures 1 and 2 have 
g same general form as before. In Proposi- 
a4, parts (i) and (ii) are unchanged, and 
(ii) follows as before except that condition 
E (14) becomes H(S*)* — 4(d — b)G(S*)/S* 
$ <0, where H(S*) = G’(S*) — G(S*)/S*. 
Proposition 5 is unchanged. To carry out the 
er trajectory iB Easter Island simulation we, of course, require 
is given byak same specific functional form. The logistic 
Mee form works well, but it is clear from this brief 
diseussion that other forms would also work. 
6fe general analysis of general functional 
Structures is difficult, although methods along 
‘th-lines of those used in Peter Howitt and R. 
ary McAfee (1988) could perhaps be 


E = G(S*)/(S*eß), where S* = (d — b)/ 


must raise| 


n systemi 
the systems 
hat starts in 


REFERENCES 


We can hage: k= 
1) to obtat Adami, Robert McCormick. Heartland of cities: 
ajectory a Surveys of ancient settlement and land use 


oR the central floodplain of the Euphrates. 
Tago; University of Chicago Press, 


. 


73 pp. 855 ey m0a-hunting in prehistoric New Zealand. 
snidge: Cambridge University Press, 


Paul and Flenley, John. Easter Island earth 
London: Thames and Hudson, 1992. 
l S The Polynesians: Prehistory of 


: r d : 
wth People. London: Thames and 
ic growls 1987 


dily genef 


¥ steady-state value of L is expressed as . 


Atholl. Prodigious birds: Moas and-* 





BRANDER AND TAYLOR: SIMPLE ECONOMICS OF EASTER ISLAND 137 


Blombery, Alexander M. and Rodd, Tony. Paims: 
An informative practical guide to palms of 
the world, their cultivation, care and land- 
scape use. London: Angus and Robertson, 
1982. ltl at ke 

Boyce, William E. and DiPrima, Richard C. Ele- 
mentary differential equations and bound- 
ary value problems, 5th Ed. New York: . 
Wiley, 1992. 

Boyer, George R. ‘‘Malthus Was Right after - 
All: Poor Relief and Birth Rates in South- 
eastern England.” Journal of Political 
Economy, February 1989, 97(1), pp. 93- 
114. E 

Brander; James A. and Taylor, M. Scott. ‘‘Inter- 
national Trade and Open Access Renewable 
Resources: the Small Open Economy 
Case.’’ Canadian Journal of Economics, 
August 1977, 30(3), pp. 526-52. 

Brown, John Macmillan. The riddle of the Pa- 
cific. London: Fisher Unwin, 1924. 

Brown, Lester R. ‘‘Nature’s Limits,’’ in Lester 
R. Brown, ed., State of the world 1995. New 
York: Norton, 1995, pp. 3~20. 

Clark, Colin W. Mathematical bioeconomics: 
The optimal management of renewable re- 
sources, 2nd. Ed. New York: Wiley, 1990. 

Culbert, T. Patrick. The collapse of ancient 
states and civilizations. Tucson, AZ: Uni- . 
versity of Arizona Press, 1988. 

Daniken, Erich von. Chariots of the gods? Un- 
solved mysteries of the past. New York: 
Putnam, 1970. 

Dark, Kenneth R. Theoretical archaeology. Ith- 
aca, NY: Cornell University Press, 1995. 
Dransfield, J.; Flenley, J. R.; King, S. M.; 
Harkness, D. D. and Rapu, S. “‘A Recently 
Extinct Palm from Easter Island.” Nature, 
December 1984, 312 (4986), pp. 750-52. 

Fernandez, Raquel and Rodrik, Dani. ‘‘Resis- 
tance to Reform: Status Quo Bias in the 
Presence of Individual-Specific Uncer- 
tainty.” American Economic Review, De- 
cember 1991, 81(5), pp. 1146-55. 


Flenley, J.; King, S.; Teller, J.; Prentice, M.; 7 


Jackson, J. and Chew, C. ‘“The Late Quater- 
nary Vegetational and Climatic History of 
Easter Island.” Journal of Quaternary Sci- 
ence, 1991, 6(1), pp. 8-115. 

Gibbons, Ann. ‘‘How the Akkadian Empire 
Was Hung Out to Dry.” Science, August 
1993, 261 (5122), p. 985. 


138 THE AMERICAN ECONOMIC REVIEW 


Gordon, H. Scott. ‘“The Economic Theory of a 
Common-Property Resource: The Fishery.”’ 
Journal of Political Economy, April 1954, 
62(2), pp. 124-42. 

Heyerdahl, Thor. The Kon-Tiki expedition. 
London: Allen and Unwin, 1950. 

. Easter Island: The mystery solved. 
London: Souvenir Press, 1989. 

Homer-Dixon, Thomas. ‘‘Environmental Scar- 
cities and Violent Conflict: Evidence from 
Cases.” International Security, Summer 
1994, 19(1), pp. 5—40. 

Howitt, Peter and McAfee, R. Preston. ‘“Stability 
of Equilibria with Externalities.’’ Quarterly 
Journal of Economics. May 1988; 103(2), 
pp. 261-77. 

Kirch, P. V. The evolution of the Polynesian 
chiefdoms. Cambridge: Cambridge Univer- 
sity Press, 1984. 

Lekson, Stephen H. and Cameron, Catherine M. 
“The Abandonment of Chaco Canyon, the 
Mesa Verde Migrations, and the Reorgani- 
zation of the Pueblo World.” Journal of 
Anthropological Archaeology, June 1995, 
14(2), pp. 184—202. 

Lee, Maw-Lin and Loschky, David J. ‘‘Malthusian 
Population Oscillations.’’ Economic Journal, 
September 1987, 97(387), pp. 727-39. 

Malthus, Thomas R. An essay on the theory of 
population. Oxford: Oxford University 
Press, 1798. 

May, Robert M. Stability and complexity in 
model ecosystems . Princeton, NJ: Princeton 
University Press, 1973. 

Neher, Philip. Natural resource economics: 
Conservation and exploitation. Cambridge: 
Cambridge University Press, 1990. 

Norgaard, Richard B. Development betrayed: 
The end of progress and a coevolutionary 
revisioning of the future. New York: Rout- 
ledge, 1994. 

O’Hara, Sarah L.; Street-Perrott, F. Alayne and 
Burt, Timothy P. “‘Accelerated Soil Erosion 
Around a Mexican Highland Lake Caused 
by Prehispanic Agriculture.” Nature, 
March 1993, 362(6415), pp. 48—50. 

Opsomer, Jean-Didier and Conrad, Jon M. ‘‘An 
Open-Access Analysis of the Northern An- 
chovy Fishery.” Journal of Environmental 
Economics and Management, July 1994, 
27(1), pp. 21-37. l 
































evolution of institutions for collective; 
tion. Cambridge: Cambridge Unive 
Press, 1990. wt, 

Ostrom, Elinor; Gardner, Roy and War. PN 
James. Rules, games, and common-pool » Bae 
sources. Ann Arbor, MI: University gf 
Michigan, Press, 1994. bots, fe 

Ricardo, David. Principles of political econ 
and taxation. (Reprinted) London: Deg $; 
1817. ž 

Ragozin, David L. and Brown, Gardner. ‘Hg 
vest Policies and Nonmarket Valuation; 
Predator-Prey System.” Journal of Enj} 
ronmental Economics and Managemen} a 
June 1985, 12(2), pp. 155-68. j 

Schaefer, M. B. ‘‘Some Considerations of P 
ulation Dynamics and Economics in Reh: 
tion to the Management of Marg 
Fisheries.” Journal of the Fisheries 
search Board of Canada, 1957, 14, 
669-81. 

Scott, Anthony D. and Southey, Clive, ‘ 
Problem of Achieving Efficient Regul} 
tion of a Fishery,” in Anthony D. Scoag 
ed., The economics of fishery manage} 
ment: A symposium. Vancouver, Cana 
Institute of Animal Resource Ecolo 
University of British Columbia, 1969, p 
47-59. : 

Smith, Vernon L. ‘“The Primitive Hunter Ut 
ture, Pleistocene Extinction, and the Rised 
Agriculture.” Journal of Political Ecos 
omy, August 1975, 83(4), pp. 727-55. : 

. “Economic Principles in the Emt 
gence of Humankind: Presidential Addré 
to the- Western Economic Associatio 
Economic Inquiry, January 1992, 30(Ihf 
pp. 1-13. ’ 

Tainter, Joseph. The collapse of complex 8? 
eties. Cambridge: Cambridge Univer 
Press, 1988. >” 

Van Tilberg, Jo Anne. Easter Island: Arc 
ology, ecology, and culture. London: BR 
ish Museum Press, 1994. oe 

Weiss, H.; Courty, M. A.; Wetterstrom, 
Guichard, F.; Senior, L.; Meadow, R 
Curnow, A. ‘‘The Genesis and Collap*<§ 
Third Millennium North Mesopotat 4 
Civilization.’’ Science, August Be 
261(5122), pp. 995—1004. 















Sources of Inefficiency in a Representative Democracy: A 
l Dynamic Analysis 


By TIMOTHY BESLEY AND STEPHEN -COATE * 


Ay This paper studies the efficiency of policy choice in representative democracies. 







































One of the crowning achievements of neo- 
- elassical economics is a rigorous appreciation 
f the performance of markets in allocating re- 
sources. However, for resources allocated in 
| the public sector, our understanding is much 
Name 35 complete.’ In part this reflects the lack of 
iony D. SOREA: 3 satisfactory theoretical framework to analyze 

mm policy choice in representative democracies. 
| Besley and Coate (1997), we introduced a 
møödel of democratic policy making as a can- 
idate framework for the systematic analysis 
these questions. In this model, policy au- 
is delegated to particular citizens and 


7, Clive. STi 


and the Risa 
>olitical Ec 
p. 727-554 
s in the bm 
jential Addn 
Associatioay 


Besley: Department of Economics, London School of 
v Eectontics, Houghton Street, London WC2A 2AE; Coate: 
nent of Economics, University of Pennsylvania, 
Iphia, PA 19104. The authors are grateful for com- 
A two anonymous referees, Avinash Dixit, Tim 

: Kenneth Koford, John Lott, George Mailath, 
Maria Milesi-Ferretti, and Stephen Morris, as well 
zous seminar participants. Besley is also grateful 
hospitality of the Research School of Social Sci- 
fit dah an National University, where his work on 
Despite 











of this Paper was completed. 
ite the vast Volume of research on public choice, 
Clear consensus on the ability of representative 
hes repe efficient outcomes. At one extreme 
as e Chicago tradition, such as George Stigler 
eyez Gary Becker (1985), and Donald Wittman 
face) argue that political competition will bring 
os t policy choices. At the other extreme are 
asya., rnanan, Gordon Tullock, and their followers 
: School’’), who see “‘political failures’’ as 
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It extends the citizen-candidate model of democratic policy-making to a dynamic 
- environment. Equilibrium policy choices are shown to be efficient in the sense 
“- that in each period, conditional on future policies being selected through the 
> > democratic process, there exists no alternative current policy choices which can 
™ raise the expected utilities of all citizens. However, policies that would be de- 
clared efficient by standard economic criteria are not necessarily adopted in 

` +" political equilibrium. The paper argues that these divergencies are legitimately 
-- 7 Yiewed as ‘‘political failures.’’ (JEL D61, D78, H11) 


individuals compete, through the electoral pro- 
cess, to acquire this power.” 

This paper uses the model to investigate the 
efficiency of equilibrium policy choices with 
repeated elections. Efficiency issues are then 
more subtle because preferences extend over 
the entire future policy sequence, while policy 
makers can control only what happens in their 
current term. We find that, while political 
equilibrium does satisfy a certain efficiency 
property, this does not imply that policies are 
efficient according to standard economic cri- 
teria. We also discuss whether the nonimple- 
mentation of ‘economically efficient policies 
can legitimately be viewed as ‘‘political fail- 
ure,” drawing the parallel with the market fail- 
ure literature. 

To analyze policy choices in a dynamic set- 
ting, we embed our model of representative 
democracy in a simple two-period economic 
model that incorporates redistribution and 
public investment. A single consumption good 
is produced using labor and citizens obtain 
utility from consumption and leisure, differing 
in their productive abilities. Redistributive tax- 
ation takes the form of a linear income taxas 
in Thomas Romer (1975) and Alan H. Meltzer 


2 A similar model, which shares the basic idea of view- 
ing candidates for political office as citizens, was intro- 
duced independently by Martin J. Osborne and Al 
Slivinski (1996). They coined the term ‘“‘citizen- 
candidates’’ to describe the approach. 
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and Scott F. Richard (1981). The public in- 
vestment, which can be thought of as infra- 
structure or education, enhances citizens’ 
second-period productivities.* In each period, 
the citizen selected to be policy maker chooses 
the parameters of the tax system. The first- 


period policy maker must also decide whether, 


to implement the public investment. 

We show that equilibrium policy choices are 
efficient in the following sense; in each period, 
conditional on future policies being selected 
through the democratic process, there exist no 
alternative current policy choices that can raise 
the expected utilities of all citizens. To show 
that this does not imply that equilibrium policy 
choices are efficient according to standard eco- 
nomic definitions, we focus on the public in- 
vestment decision. We identify three reasons 
why public investments that are potentially 
Pareto improving with the available policy in- 
struments may not be undertaken in political 
equilibrium. Each stems from the problem that 
when a society makes policy decisions via rep- 
resentative democracy it cannot commit to fu- 
ture policy outcomes. 

The first reason concerns nonpayment of fu- 
ture compensation. To generate a Pareto im- 
provement, a public investment may require 
some individuals to be compensated, via the 
tax and transfer system, after future gains are 
realized. However, compensation may not ac- 
tually be paid in political equilibrium, since 
future tax rates are determined by policy mak- 

_ers at that time. Thus, if future policy makers 
are expected to have different preferences, the 
incumbent may be deterred from undertaking 
efficient investments. 

The other two reasons apply to public in- 
vestments that do not require manipulation of 
future taxes and transfers to be Pareto improv- 
ing. These may not be undertaken if they 
change the identity of future policy makers in 
a way disadvantageous to the current incum- 
bent. This can happen when an investment, by 
altering citizens’ productive abilities, leads to 
changes in preferences for redistribution, .re- 
sulting in the election of a citizen with differ- 


* For reasons that are subsequently discussed, the prob- 
lems we identify are less likely to apply to public invest- 
ments that produce future consumption benefits. 
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- cause they can control how such resources are all 


































ent policy preferences. Public investments} 
change the choices of future policy mai 
may also not be undertaken. By changin 
income distribution, a public investmey 
alter the desired tax rates of future policy y ma: 
ers. Such changes can deter an incumbent. 
undertaking an investment. 
To assess whether the nonimplementatg§ 
of potentially Pareto-improving investmeng 
legitimately viewed as a “‘political failure’, 
quires us to take a stance on what constitu 
such a failure. In the interests of creatine, 
level playing field for markets and goveg 
ments, a sensible definition of political faily # 
should parallel that used in a market contey ¥ 
In this sense, we argue that failing to undertyh 
a public investment that is potentially Pang 
improving with the available policy ing 
ments does constitute a political failure.’ 


and Lars Svensson (1989) argue that ‘sbi Cilzens diff 
icy can stray from the efficient path in politi 


deficits to manipulate the choices of foak 
policy makers who do not share their poli 
preferences. This is one reason for polita 
failure in our model.* Philippe Aghion a} pni 


sE 


Patrick Bolton (1990) and Gian Maria Mile ao 
Ferretti and Enrico Spolaore (1994) deve &. 
models in which policy is distorted becats $ 
current policy choices affect which politi 
party will win in the future. This incentive 
also present in our model.® 


* This paper builds on the analysis of inefficiency is $: 
static model in Besley and Coate (1997), which shot g z 
how political competition could fail to secure the elec 

of the most competent policy makers. Imperfect i t 
mation can also generate inefficiencies as in Kem 
Rogoff (1990) and Coate and Stephen Morris ( 1985 

5 In a related argument, Guido Tabellini and 
Alesina (1990) show that, in an environment of poli a 
instability, those currently holding political power ha 4 
incentive to borrow from the future (i.e., run deficits) ® 


See also Amihai Glazer (1989). 

© Most existing models rest on incomplete app 
to the electoral process. Persson and Svensson (1 
fer no political model; they assume that current 5 
makers anticipate future policy makers with differed 
erences. Aghion and Bolton (1990) and Milesi Fa ; 
and Spolaore (1994) use a model of two parties W 
















soe 8 
E jte organization of the remainder of the pa- 
fe follows. The model is presented in Sec- 
N img 1, In Section II political equilibrium is 
a, wa and equilibrium policy choices are char- 
Ia. --ed, Section II analyzes the efficiency of 
ieee licy choices emerging from political equi- 
BS f ium, focusing largely on the investment de- 
ÜM — <1, Section IV discusses the interpretation of 
ts, while Section V concludes. 


Re gy: resul 
[3 p} 4 
hie I. The Model 














A. The Economic Environment 


Wy 4 
iy ' The economy consists of N citizens, indexed 
by i € N= {1, ..., N}, and lasts for two 
í periods, indexed by 7 € {1, 2}. There is a 
Æ single (nonstorable) consumption good, de- 
Š sted by x, that is produced using a single fac- 
"or, labor, denoted by £. Each citizen is 
: endowed with one unit of labor in each period. 
itizens differ in their productive abilities. If 
‘ eitizen i supplies £~; units of labor in period 7, 
: he produces x,; = a,;,;, where a,; is his period 
| ¢ “ability.” The consumption good is pro- 
$ duced competitively with wage rates being 
ae equal to abilities for all citizens. Citizen i’s 
$ period r utility is u(x,;, €n), where the com- 
< mon per period utility function u(-) is smooth, 
F itrictly quasi-concave, increasing in x, and 
E: &nincreasing in €. Second-period utility is 
f; at discounted. 
$! -At the beginning of each period, a policy 
k: maker selects the parameters of a linear in- 
: eoiie tax:” the tax rate t E [0, 1] and the guar- 
f antee T E€ R. With period 7 tax system (¢,, 
F T), a citizen with income y has a tax bill of 
fy.~.T,, The first-period policy maker must 





es of fut 
their poli 


























“genous policy preferences. They also do not explain how 
ae mcumbent party came to be in power. Tabellini and 
p: Alesina (1990 p. 39) assume that decisions are made by 
Seyority rule. However, they need to impose structure on 
Peia to guarantee the existence of a median voter 
llamas assume that the identity of the median voter 
xe or Over time, ‘‘due to: (i) random shocks to the costs 
:  -Foting that affect the participation -rate ... 
$ in the eligibility of the voting population.” 
This Could be motivated,.following Geoffrey Brennan 
anan (1980), as a constitutional constraint on 
© instruments. Our basic analytical method 
y on with any given set of redistributive 


lete approa 
sson (1989, 
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- also choose whether or not to undertake a dis-- 


crete public investment. This decision is de- 
noted by g, with g = 1 (0) meaning that the 
investment is (not) undertaken. The invest- 
ment costs C units of the consumption good 
but is productive, raising all individuals’ 
second-period abilities. Thus, for all citizens i, 
au = au (8) where ay (1) = az,(0).? 

Labor supply decisions are mäde~after the 
policy maker has selected the tax system. With 
tax system (t,, T,), citizen i will supply € (t, 
T,, a,i) units of labor, where £ (t, T, a) = arg 
max{u((1 — t)a£ + T, £): <€ € [0, 1]}. He 
will earn y(t,, T-, a,;) units of income, where 
y(t, T, a) = a€(t, T, a), and will enjoy a 
utility level v(t., T,, @,;), where v(t, T, a) = 
u((1 — t)y(t, T, a) + T, 2(t, T, a)). 

The policy maker’s tax and investment 


-choices must be feasible. In period one this 


amounts to satisfying the constraint: 


N 
(1) t È y(t, Ti, a) — NT, = Cg. 

i=) 
The left-hand side of (1) is net tax revenues 
and the right-hand side is public expenditures. 
Let Z, denote the set of period-one policy 
choices (t,, Ti, g) which satisfy (1). Since 
there are no public expenditures in period two, 
the policy maker’s feasibility constraint is 


N 
(2) t }, y(t, Ta, ax (8)) — NT: = 0. 


i=l 


Let Z (g) denote the set of period-two policy 
choices (f, T2) which satisfy (2), when the 
investment decision is g. 


B. The Political Process 


The model of policy-making is based on 
Besley and Coate (1997). In each period a 


8 The public investment may create incentives for in- 
dividuals to borrow from one another: a citizen who ex- 
~ pects ta.benefit greatly from the investment may gain from 
borrowing from a nonbeneficiary in the first period. To 
focus directly on the efficiency of public choices, how- 
ever, we avoid the complications associated with intro- 
ducing a private loan market. , 
? The Downsian model of policy-making with two-vote 
maximizing parties is difficult to apply to the environment 


t 


wa 
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member of the community is selected to make 
policy choices, with an election determining 
the choice of citizen to do this. All citizens are 
able to run in these elections and each must 
choose whether or not to declare himself as a 
candidate. Running for office is not costly.'° 
All individuals in the society then vote over 


* the’set of self-declared candidates. The can- 


didate with the most votes wins (there is plu- 
rality rule). In the event of ties, the winning 
candidate is chosen randomly with each tying 
candidate having an equal chance of being se- 
lected. If only one individual runs for office 
then he is automatically selected. If no citizen 
runs, taxes are zero and laissez-faire prevails. 


‘The winning candidate selects policy only in 


the period for which he is elected. 

The political process in each period is mod- 
eled as a three-stage game. Stage.one sees each 
citizen deciding whether or not to become a 
candidate. In stage two, citizens vote over the 
set of self-declared candidates. At stage three, 
the winning candidate chooses the policy for 
that period. In voting, citizens anticipate can- 
didates’ policy choices and vote accordingly. 
As potential candidates, citizens anticipate this 
voting behavior. The two periods are linked via 
the public investment decision, which affects 
citizens’ second-period abilities and hence their 
preferences over period-two policies. Citizens, 
as voters and policy makers, must anticipate the 
consequences of their first-period decisions for 
second-period outcomes.'! 

Viewed ex ante, the political process gen- 
erates a probability distribution over policy se- 
quences. A policy sequence is a pair {(t,, T), 
2), T2} consisting of a first-period policy 
choice (f, Tı, g) € Z, and a probability dis- 
tribution over second-period policy choices 


of this paper since the first-period policy choice involves 
both a tax rate and an investment decision. Without further 
restriction, this standard model typically fails to produce 
a prediction in two-dimensional models. 

10 The results of the paper would be unaffected by the 
assumption that running for office has a small cost (say, 
in the form of reduced consumption). 

'' The model assumes that citizens are farsighted. 
While assuming myopic voting is common in dynamic 
public choice models, it is not very satisfactory when 
questions of efficiency are on the agenda. For a useful 
review of the different ‘‘shortcuts’’ taken in dynamic pub- 
lic choice models, see Per Krusell et al. (1997). 
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TV, (CÈ; Ti, g), T2) denote citizen i’s ex 


























Tz : Za(g) > [0, 1]. With policy seg 
{(t1, 71,8), T2}, the first-period choice jg 
T,, g) and the second-period policy cho; 
(h, T2) E 2,(g) with probability z(t, 
The probability distribution 73 is generate 
‘the political equilibrium arising in periody $ 
when the public investment decision is g 8y 


utility under the policy sequence {(t,, T, 
T2}, defined as 


S mlh, T2) v(t, To, ax (g) 


ci 
(12,72) E A(T) = 


te ge 


where we use A(-) to denote the support 
probability distribution; i.e., the set of, 
comes which are selected with positi 
probability. : 


political decision-making in both periods, defs 
ing political equilibrium and characterizing ep 
librium policy sequences. We begin with ti 
second-period election and policy choice, taki 
as given the public investment decision. We ths &: 
analyze the first-period election and politj 
choice, recognizing that citizens as voters a 
elected policy makers will anticipate the depe§ 
dence of second-period policy choices on & 
public investment decision. z 


A. Period-Two Election and Policy Choice ; 


We work through the three stages of the? 
litical process in reverse order, beginning wit 
the policy-selection stage.'? The citizen WHY 
wins the period-two election will implens 
his preferred period-two policy —promis¢s 


1? Using a probability distribution over period-twort 
icy choices captures any randomness in political out]. 3 
stemming, for example, from the possibility of ties. 1 
same randomness is possible in period one, which i% 
the political process generates a probability distrib 
over policy sequences. 

'3 The treatment will be concise and the readef 
ferred to Besley and Coate (1997) for further de! 

























Sog NO. | 


yi . ae 3 
+1 sherwise are not credible. Citizen i’s pre- 
jo P licy is given by: 


a (ty; (g)» Ta (8)) 
= = arg max { v(t, T2, 2:(g)) | (t, T2) 


EZ,(g)}. 


We assume that the solution to (4) is unique.'* 
-aciated with each citizen’ s election, there- 
fore; is a utility imputation (v3 (8g), ..., 
gix(g)), where v2;(8) = v(ti(g), Tri (8), 
4 a;( g)) is individual j’s period-two utility if i 
x jse 
| Seeond-period policy is (tao(g), Tao(g)) = (0, 
0), We denote the utility imputation in this 
“pag as (U (Eira t VCE): 

E .. We now tum to the voting stage. Let the set 
K of candidates be C C Nand let ay E C U 
B {0} denote citizen j’s decision, where a; = i 
k: denotes j voting for candidate i, and a, = 0 
denotes abstention. A vector of voting deci- 
sions is @ = (Q21, ..., Qan). Given C and a, 
B: let P‘(C, œ) denote the probability that can- 
didate i wins. Under the assumption of plural- 
ity rule, P‘(C, œ) equals 1 if i has the most 





ws 


choice, takig Si votes or is the only candidate. It equals 1/M, 
‘ision. We the : if there are M tying candidates of which i is 
m and polig m. Ge. It is zero otherwise. 

as voters E:  [ndividuals vote to maximize their expected 
vate the d k utility given the voting decisions of others. A 


H 


'hoices on SEE period-two voting equilibrium given g is a vec- 

: A tar of decisions a} such that, for each citizen 

4 EN: (i) a% is a best response to æž_ ;, i.e., 
Wy * 


(8) at ¢ argmax{ SPUC @pats) 
* oe iec 


X v2;(g) lax E CU wo}, 


er period-two 
political oul aa 
bility of tics aa 
one, which Js) 

ibility distor 






While itis highly unlikely that (4) will have multiple 
sy? It is difficult to come up with a set of simple 
Conditions to rule it out. The conditions used by 
and Richard (1981) to guarantee the inverse re- 
een tax rates and productive ability (u(-) 








4 the readef, 
further deta 
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and (ii) a3; is not a weakly dominated voting ` 
strategy. Ruling out weakly dominated strate- 
gies is standard in the voting literature and im- 
plies that citizens do not vote for their least 
preferred candidate. Hence voting is sincere in 
two-candidate elections. 

_We are now ready to describe the entry 
stage. Citizens run for office to move policy 
in their preferred direction, with their ability 
to do so dependent on who else enters. Let 
Sz; © {0, 1} denote citizen j’s pure strategy, 
where sy; = 1 denotes entry. Let s) = (52, ..., 
Sov) denote a pure strategy profile. The set of 
period-two candidates with strategy profile s, 
is C(s2) = {ilsa = 1}. 

Each citizen’s expected payoff from any 
given strategy profile depends upon antici- 
pated voting behavior. We let a,(C) be the 
voting behavior citizens anticipate when the 
candidate set is C. Thus, if citizen i enters then 
he/she anticipates winning with probability 
P'(C(s2), œ(C(s2))). The expected payoff 
to any citizen j given s, is therefore 


(6) Uy (S238, a2(-)) 
= 5 P'(C(sz), @2(C(s2)))v5(g) 


iE C(s2) 


+ P°( C(s,))v3;(g), 


where P°( C(s,)) denotes the probability that 
the default outcome is selected. This equals 
one if C(s2) = Ø and zero otherwise. Citizen 
j’s payoff is thus the probability that each can- 
didate i wins in the race times his payoff from 
i’s preferred policy. 

To ensure the existence of an equilibrium, 
we need to allow for mixed strategies. Let yy 
(€ [0, 1]) be a mixed strategy for citizen j, 
with a mixed-strategy profile being denoted by 
Yo = (Ya, +> Yew) and citizen j’s expected 
payoff under y, being denoted by w(7: 8, 
a,(-)). An equilibrium of the period-two elec- 
tion given g is a vector of entry decisions Yz 
and a voting function a(-) such that: (i) for 
each citizen j, yy is a best response to Y2_; 


ee ne a Se 
is strictly concave and satisfies the Inada conditions and 
both consumption and leisure are normal goods) are cer- 
tainly sufficient, but much stronger than necessary. 
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given g and @,(-), and (ii) for all nonempty 
candidate sets C C N, œ( C) is a voting equi- 
librium given g. 

Associated with any such equilibrium (72, 
@(-)) are two probability distributions. First, 
a distribution over the citizen who chooses 
period-two policy denoted by r2(y2, @2(-)), 
where r(Y, @2(-))(7) is the probability that 
citizen i will be the policy maker and 7,(72, 
@,(-))(0) the probability that nobody runs. 
Second, a distribution over second-period pol- 
icies, denoted by 72( 72, @2(-), g), where 
T(Y2, @2(-), 8) (to, T2) is the probability that 
the second-period policy is (t,, T2). These 
probability distributions are related by 


(7) T2(Y2, @2(-), 8) (te, T2) 
= > 


JE LIE NUO: (8) Ta8) =T) 
B. Period-One Election and Policy Choice 


Citizens look forward in making their first- 
period decisions. If they anticipate an equilib- 
rium of the period-two election (y$, @5(-)) 
when the investment decision is g € {0, 1}, 
then the probability distribution over second- 
period policy choices is expected to be 73 = 
m(¥3, @3(-), g), with T, = (72, 72). If 
elected as first-period policy maker, citizen i 
will select period-one policy 


(8) (tAk), Te AL), g:Wh)) 
= arg max { V(t, Ti, g), 73) | 


(h, T,, 8) € Z}, 


given Il,. We again assume that the solution 
to (8) is unique. Associated with each citizen’s 
election is a utility imputation (vi, (T2), . 
vix(Il2)), where vyh) = 
Tu (IL), g; (IL); r3) is individual j j’s period- 
one expected utility if i is elected. If 
no citizen stands for office, the first-period pol- 
icy is (ño (Il2), Tio(Ih), gL) = (0, 0, a 
and the utility imputation is (vfi (IL), 
v în (A)). 

The period-one voting stage parallels that in 
period two. Let the set of candidates be Ce 
N and let a; E C U {0} denote j’s voting 


12 (Y2,02(-)) (J). . 


V;( tu (L), ; 


_ is one which could potentially arise in po 




























decision. A vector of voting decisions q 


Penod P 
(až, -. @jy) is a period-one voting eoni koari a 
rium, given Il, if for each | citizen j © jig Rien o od 
at; is a best response to af_ j, and (ii) g cal e 
not a weakly dominated voting strategy, 1 ion 


Turning to candidate entry, sy € {0, 1} 
each citizen’s pure strategy, with œ; (-) 
the (common ) function describing anticipaj } 
voting behavior. The payoff to any citizey:# 
given his own entry decision and those of Oth | 
citizens is given by 


(9) U,;(s;; Th, ai(-)) 
= 5 P'(C(s;), a1 (C(s,))) vi E 


ie Cisy 
+ P°( C(si))v9 (Th) : 


Let yı; be a mixed strategy for citizen j andi 
y = (yu ..., Yin) be a mixed-strategy profik 
Citizen j’s expected payoff under the mixed. $ i 
strategy profile y, is denoted w; (Y, Mn five suci 
&()). ‘ 

An equilibrium of the period-one election $ 
given II, is a vector of entry decisions y, a 
a voting function @,(-) such that: (i) for ead 
citizen j, yı; is a best response to y, -j giv? 
II, and (ii) for all nonempty candidate si $ 
CC N, a;,(C) is a voting equilibrium giv } 
Il. Associated with any equilibrium (4s 
a,(-)) are probability distributions over fit } 
period policy makers (r, (Y;, @(-))) andon} 
first-period policies (7, (yı, @:(-), Th)). 





C. Political Equilibrium and Equilibrium } 
Policy Sequences — $ 


A political equilibrium is a triple {( 
@,(-)), (Y2, @3(- Ye e (0.1; } Such that: i 
@ l )) is an equilibrium of the _ period: 


over second-period policies implied by 
a3(- Ve € (0.1}> and (ii) (Y3, a3(- )) is an 
librium of the period-two election given sf 
ge {0, 1}. An equilibrium policy seque’ 


equilibrium. More precisely, {(t*, Ti: 

m*} is an equilibrium policy sequence i 
se a political equilibrium q (Hi ; ni 
(y3 @2(- ))e € 10,1; } such that (¢7, TY „g“ oes 
in the support of the probability distribution jaa d 








































poe 88 NO 
i licies generated by that political 
garperiod ae a* is the probability distribution 
sr 'second-period policy choices generated by 
over Jitical equilibrium when the public invest- 
=t, decision is g*. It is straightforward to show 
a political equilibrium, and hence an equi- 
ghrium policy sequence, exists.” 


m. Analysis of Equilibrium Policy Sequences 

; This section investigates the properties of 
` he policy sequences emerging from political 
‘amailibrium. We begin by noting that a certain 
| pe of efficiency property follows immedi- 
gely from the fact that policy makers are 
Ș. mility-maximizing citizens. 


: PROPOSITION 1: Let {(t*, Ti, ¢*), 73} 


ke an equilibrium policy sequence and let 
te 8g 


{y a,(-)), (Y3, @3(-))ee(o.1)} be the as- 
sociated political equilibrium. Then, (i) if 
74 = | for some j © N, there is no (t,, Ti, 
; g) € 2, such that 


a triple (age: 
'h that: (i EY 


gaPosition 1 of Besley and Coate (1997) implies 
-= “exists at least one equilibrium of the period-two 
for each possible level of g. Any ‘pair of ‘such 
(one for both levels of g) generates a pair of 
distributions over second-period policy 
icy =! sition 1 of Besley and Coate (1997) also 
rise in a poe exists an equilibrium of the period-one 
( i , Tij £ ‘ any pair of probability distributions over 
quence if in oe? choices. Hence, there must exist a 
(Cy ag A num. Existence of equilibrium in this type 
RARAS Kee erandidate” model is not problematic because 
TTi Idate: face a discrete ‘‘in-or-out’’ decision, 
ton about which platform to adopt. 


listributiong 
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Part (i) says that if at least one citizen enters 
the first-period election with probability one 
then, conditional on second-period policy 
choices being chosen through the democratic 
process and the equilibria of the secorid-period-. 
election being (73, @3(+)), e (0,1), there exists 
no alternative first-period policy choice which 
gives every citizen more expected utility. This 
follows from the fact that the first-period pol- 
icy choice is optimal for some first-period pol- 
icy maker. Part (ii) says that there is no 
alternative second-period policy choice which, 
given the public investment decision, gives 
every citizen more second-period utility. The 
caveat that at least one citizen run is not nec- 
essary, since the default outcome is Pareto ef- 
ficient in the second period. 

Proposition 1 parallels a result (Proposition 
10) in Besley and Coate (1997), and is a nat- 
ural consequence of including the policy 
maker’s utility in assessing efficiency. How- 
ever, it does not imply that equilibrium policy 
sequences are efficient in the usual econo- 
mists’ sense. Standard notions of economic 
efficiency are defined without reference to the 
policy-making process, taking no account of 
the institutional constraints that shape actual 
decisions. In this vein, a policy sequence is 
efficient if there exists no other technologi- 
cally feasible policy sequence which makes 
all citizens better off. There is no guarantee 
that policy sequences are efficient in this 
sense. 

We illustrate this by focusing on the public 
investment decision. We suppose that the pub- 
lic investment is potentially Pareto improving 
in the sense that for any policy sequence { (4, 
Tı, 0), m2} there exists an alternative policy 
sequence {(t{, Ti, 1), 74} such that V;((t1, 
Ti, 1), 73) > Vi((f, Ti, 0), 72) for alli € 
N. We will then identify three reasons why 
the investment might*not be undertaken in po- 
litical equilibrium. 

We bégin with a benchmark result which 
establishes a sufficient condition for the in- 
vestment to be implemented in political 
equilibrium. 


PROPOSITION 2: Let {(t*, T7, 9*), 77} 
be an equilibrium policy sequence and let 
{Cy @:(-)), (73, @2(-))g € (01) } be the as- 
sociated political equilibrium. Suppose that 
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for all j € A(rı(yı; &())) and k € 
Alra(y2 æl D) 


(1), To(1)) = C), Ty(1)). 


Then, if y; = 1 for some j E Nand the public 
investment is potentially Pareto POE: 
gt =], = 


PROOF: 
See Appendix. 


The key condition is that all first-period pol- 
icy makers anticipate that, were they to un- 
dertake the investment, the second-period 
policy maker would choose exactly the same 
policy as they would. If the investment is po- 
tentially Pareto improving, there exists (by def- 
inition) a way of redistributing the gains so 
that all individuals benefit. Thus a policy 
maker who knows that someone who shares 
his policy preferences will govern next period 
is sure that the redistribution, as needed to 
make the investment worthwhile for him, will 
be forthcoming. The following example shows 
how political instability can lead the current 
incumbent to forgo the investment. 


Example 1: Suppose that the population is 
equally divided into two ability types—low 
and high, Let N; denote the set of low-ability 
individuals and Iy the set of high-ability in- 
dividuals. Assume that individuals care only 
about their consumption and are risk neutral, 
so that u(x, €) = x. Let a, denote the first- 
period ability of low-ability individuals and ay, 
(>a) that of high individuals. If the invest- 
ment is not undertaken, individuals’ second- 
period abilities equal those in the first period. 
If it is undertaken, then it raises the ability of 
high-ability individuals. Formally, a.;(1) = 
ay + ô fori E Ny, while ay (1) = a, fori € 
N. We assume that 6/2 > C/N, which im- 
plies that the public investment is. potentially 
Pareto improving. 
We now demonstrate the existence of an 
- equilibrium policy sequence in which the in- 
vestment is not undertaken. We begin by de- 
scribing the equilibria of the second-period 
election. Note first that if a high-ability citizen 
is elected to office in period two, he will choose 
zero taxation irrespective of the public invest- 
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-sion, therefore, there is an equilibrium of 

































ment decision, implying that (ty,(1), Tay 
(ta7(0), To#(0)) = (0,0). A low-abiliye = 
dividual will choose a tax rate of 100 pe 
and hence (f%;(1), Ta (1) = (1, @ 4; 
and (t2,(0), Ta (0)) = (1, @), where @ = 
ay)/2 is the mean first-period ability. 

Irrespective of the public investment ç 


second-period election in which a row 
individual runs against a high-ability individ. 
ual and all citizens vote for candidates ys, tee 
share their ability. (It is simple to check ty Fame, 
entry is worthwhile and that no other candidg 
will enter.) In this equilibrium, each Candidy 
is elected with probability '/,. When g =0 
probability distribution over second-perg 
policies generated by this equilibrium seleg 
(1, @) with probability '/, and (0, 0) with pro 
ability '⁄. When g = 1, the probability dist 
bution selects (1, Æ + 6/2) with probabilij 
' and (0, 0) with probability '⁄. 

Turning to the period-one election, obs 
that a low-ability citizen will set a tax ratéd 
100 percent if he is elected; the only questig 
is whether he will undertake the pana: 


investment if 6/4 < C/N. Under this assur É 
tion, therefore, a low- -ability citizen will seled &- 
the first-period policy (1, g, 0). A high-abiliy & 
citizen would always set a zero tax rate at All 
would always undertake the investment = eiii 
thereby selecting (0, —C/N, 1) as the fim sé 
period policy. It “follows that there is an eqé 
librium of the period-one election in whichl¥; 
single low-ability individual runs against ast $ 
gle high-ability individual and all citizens vot $% 
for candidates who share their ability. In Wg 
equilibrium, each candidate is elected 
probability '/. Assuming that 6/4 < C/N 
probability distribution over first-period 
cies generated by this equilibrium selects (d 
@, 0) with probability '⁄4 and (0, -CIN 
with probability '⁄. 
Hence, if 6/4 < C/N, there is an e 
rium policy sequence > involving the period- 
policy choice (1, @, 0) and a a 


lecting (1, @) with ‘probability i and (Wn 
with probability 'h. 5 


In this example, the investment harms 
first-period policy maker in period 09%, 






































` period-two compensation if he is to be 
ae to undertake it. Such compensation 
‘forthcoming if an individual with sim- 
ferences is elected to office, but not 
. The possibility that an individual 
= nosing preferences will be elected thus 
T pat undertaking the investment. 6 
this way, political turnover can induce pol- 
makers to resist making public investments 
"shieh entail short-run costs, if the future gains 
ntingent on actions of future policy mak- 
This is a general problem which is likely 
crop up in many public decisions, Any pol- 
‘with short-run costs and long-run benefits 
phich do not accrue uniformly may not be un- 
deitaken for this reason. 
*. -We now turn to public investments where, 
ënlike in this example, compensations by ma- 
H sipalating taxes and transfers are not neces- 
i to achieve Pareto gains—all citizens 
nefit from the.investment, provided only 
that the tax rate does not increase and the guar- 
afitee-does not decrease. In pursuit of a formal 
finition, we first develop the idea of one 
obability distribution over second-period 
introdučé MEE palicies dominating another because it in- 
this assuma Valves lower tax rates and higher guarantees. 
zen will seem Thus, for any (2, T2) € Z,(0), let W(t), T2) 
A high-abii denote the set of second-period policy pairs in 
) tax rate 23('1), for which the income guarantee is at 
ast:as high and the tax rate no higher. We 
that the probability distribution m} : Z, 
) =? 8, dominates the probability distribu- 
Ta : 2,(0) > R, if: (i) for every (¢4, 
3) € Alri) there exists (t, T2) € A(m) 
ch that (13, 73) € W(t), T2), and (ii) for all 
i € A(T), 


o check 


er candid 


K 


ch candidg 


tion, obser 
a tax rat 

only quest 

investmén ý 


are is anog 
on in wha 


z m3(t3, T2). 


(13,73) € W(1,79) 


eke first part of the, definition says that 
my -Second-period policy pair selected by 


‘lated argument is made by Avinash Dixit and 
ieis (1995), who argue that efficient private- 

Anons can be deterred because of the inability 
Parties to commit to let individuals suffer eco- 
s and/or keep economic gains. 
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73 involves a lower tax rate and a higher guar- 
antee than some policy pair selected by m. - 
The second says that, for every second-period 
policy pair selected by 7, the dominant dis- 
tribution 74 selects a policy pair-involving a 
lower tax rate and a higher guarantee with 
equal or greater probability. If the investment 
is undertaken in period one and the associated 
probability distribution over second-period 
policies dominates that which would have 
occurred absent the investment, the expected 
utility of'any citizen i in period two cannot be 
lowered. As a final preliminary, let Z}(g) 
(C 2Z,(g)) denote the set of period-two pol- 
icy choices which are not strictly Pareto 
dominated.” 

Now we define the public investment to be 
straightforwardly Pareto improving if for 
every policy sequence { (4, Ti, 0), m2} with 
A(m2) C ZF (0) there exists (t/, T/, 1) E€ Z, 
such that if m4 dominates mz, then V;((ti, 
Ti, 1), 72) > ViC(t), Ti, 0), m2) for all i € 
N. This definition requires that for any given 
policy sequence in which the investment is not 
undertaken, there exists a way of financing the 
investment which makes all citizens better off 
for all dominating probability distributions 
over second-period policy pairs.'* The require- 
ment that the investment be Pareto improving 
for every dominating distribution is the differ- 
ence with the earlier definition and embodies 
the idea of no redistribution being necessary 
to achieve an improvement. 

In Example 1 the investment was not 
straightforwardly Pareto improving. Consider 
the policy sequence involving a first-period 
policy choice (1, a, 0) and a second-period 
policy choice (0, 0) with probability one. Any 
dominating probability distribution over Z,(1) 
must also select (0, 0) with probability one. 


1 Thus, (f2, T2) € Z7 (g)if and only if there does not ; . 
exist (t3, 74) € Z2(g) such that v(13, T3, ay (8)) > v(t, 
Ta, Q2; (g)) for all i € N. 

'8 The definition also restricts comparisons to feasible 
policy sequences which select only Pareto-undominated 
tax rates in period two [i.e., which satisfy Alm) C 
Z*(0)]. This, for example, rules out comparisons with 
policy sequences which set t so high that individuals are 
deterred from supplying any labor in period two. In that 
case, no investment, no matter how much it enhanced pro- 
ductivity, could satisfy the requirements of the definition. 
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Hence, there is no way of financing the in- 
vestment in the first period to make low-ability 
individuals better off for all dominating 
probability distributions. 
It would seem more hopeful that straight- 
forwardly Pareto-improving investments 
would be chosen in political equilibrium. The 
-next result gives some conditions for this to be 
the case. 


PROPOSITION 3: Let {(t¥, T, g*), 77} 
be an equilibrium policy sequence and let 
[Ys @(-)), (¥3, a3(*))¢ € 10,1) } be the as- 
sociated political equilibrium. Suppose that: 


o 
rn(y3, @3(-)) = m(y2, @2(-)), 
and (ii) for all i € A(r2(¥3, a4(-))); 
(tu (1), Ta (1)) € W(t: (0), Ta (0)). 


Then, if yy = 1 for some j E Nand the public 
investment is straightforwardly Pareto im- 
proving, g* = 1. 


PROOF: 
See Appendix. 


This proposition weakens the condition of 
complete political stability from Proposition 2, 
requiring only that the investment does not af- 
fect the probability distribution over future 
policy makers or cause them to select a policy 
pair involving a higher tax rate or lower guar- 
antee. Both of these conditions were satisfied 
in Example 1. Nonetheless, these conditions 
are strong. Our next two examples illustrate 
cases where conditions (1) and (11) of the prop- 
osition fail and straightforwardly Pareto- 
improving investments are not undertaken. 
The examples are very stylized, intending to 
give pure cases of each violation. 


Example 2: The population is divided into 
three groups of individuals: high types, low 


types and movers. Let- NV, denote the set of - - 


high types; N, the set of low types and Ny 
the set of movers. Assume that #N, + 


#Ny = #Ny and that #Ny, < N/3. Prefer- 


ences are u(x, £) = x. High and low types 
have abilities a, and a, in both periods, where 































ay > a,. Thus, neither of these groups reg 
a direct benefit from the public inveg, 
Movers have ability a, in the first period 
if the investment is undertaken, have ; 
a, + 6 in period two. We assume that the 2:38 
to the movers is such that N(a, — a,)/2(N 
#Ny) < 6 < 3(ay — a,)/4. In addition: 
suppose that the investment is costless (¢ : 
0), which implies that the investment ls 
straightforwardly Pareto improving.” - z643 
Again we will demonstrate the existe 
an equilibrium policy sequence in which # 
investment is not undertaken. Beginning yi 
period two, a low-type policy maker would 
a tax rate of 100 percent whether or not} 
investment is undertaken, i.e., (ty 
Ta (0)) = C1, @) and (t(1), Ta (t): 
a + (#Ny/N):5) where &@ = (a, + ay) 
high-ability policy maker would choose 
taxation, so that (f(g), T2u(g)) = (0, 
g € {0, 1}. If the investment is not une& 
taken, the movers share low types’ preferi i Gaitsriak 
and hence (tm(0), Tow(0)) = (1, @). Hore $A 
ever, if the first-period investment is mk% apdidate equ 
then under our assumptions on 6, movers é} eh j 
not want any redistributive taxation implyigh te 
that (tm(1), Tow(1)) = (0, 0). j 
It follows that if the investment is HÈ 
undertaken, there is an equilibrium of & 
second-period election in which a low-abili 
candidate and a high-ability candidate tie. Ts 
low-ability candidate is supported by b% 
low-ability citizens and movers, while hipg 
ability citizens vote for the high-abilij 
candidate. The probability distribution 0*% 
second-period policies is (1, Æ) with prob 
ity '⁄ and (0, 0) with probability '/. Whe 
investment is undertaken, however, there 
equilibrium of the second-period electi0 


1 This is not quite accurate, since when dealing 
policy sequence {(t,, Ti, 0), 72} such that 72 sel 
0) with probability one, it is only possible to find 
Ti, 1) such that {(t{, Ti, 1), 74} weakly Pareto% 
nates {(t,, Ti, 0), 72} if 74 dominates nz. This is a 
if 74 dominates 72, it must be the case that 14 also 
(0; 0) with probability one. Thus the high- and lo¥ 
types cannot both be made strictly better off, beca 
do not share any of the benefits of the investment. 
ever, if we allowed the investment to also give a Y 
gain to the high- and low-ability citizens, then the ™ 
tion would be satisfied even in this case and the ® 
would be unchanged. 

































NO.1 B 


‘a single high-ability candidate runs un- 
od. No mover has an incentive to enter, 
movers have the same policy preferences 


have ; < n ahility citizens. Low-ability citizens 
> that th hi ee sure if they entered. The prob- 
— a)(i “ery distribution över second-period policies ~- 
additiog ay by this equilibrium selects (0, 0) 
ostless (Ge probability 1. 
avestment Joperiod one, a low-ability policy maker 
ing.” Zogld set a tax rate of 100 percent; the only 


n is whether he would undertake the 
ent. This depends on how it affects low 


eginnin spes’ -second-period utility. If the investment 
aker would ‘undertaken, a low type’s expected second- 
aer or not od payoff is a,, while if it is not under- 

e, (t taken, this payoff is (3a, + ay)/4. Thus a 
Ta (1)) = laweability policy maker would choose (1, a, 
1, + Gn) 2AM Gy °A high-ability policy maker would select 
d choose: 79: 0, 1), choosing no taxation and undertak- 
)) = g the investment. A mover would select (1, 

is not UNM, 1), setting a tax rate of 100 percent and also 
23° preferenaam, gndertaking the investment. 

(1, a). 19 E -it is now straightforward to find a two- 
nent 1s many didate equilibrium of the first-period elec- 
6, movers MF tien involving a low type running against a 
ation implyt h type. The low types and movers vote for 
- k. è low-ability candidate, while the high- 
stment 1S ability voters vote for the high-ability candi- 
librium of è. Anticipated voting behavior is such that 
h a low-abl@@if a mover entered the race, the high-ability 
ididate tie idate would be expected to win. To verify 
orted by® il: movers prefer the low type to the high 
rs, while hi ê; note that with a high type, a mover’s life- 
e high- méütility is 2a, + 6, while under a low- 
stribution Hillty type it is 3a,,/4 + 5a,/4. The former is 
Bate Pai _ t-than the latter under our assumption 
ty ho. tz 
aa theré 1s 
iod electio 
when dealin a ; 

h that 72 S&U ‘This example illustrates the difference between our 
ssible to fi or of the Political process and the Downsian frame- 
reakly Pare! ving two competing parties offer the first-period .- 

s 12. This is +@, 0) and (0, 0, 1) would be inconsistent with 

that 73 The party offering the platform (0, 0, 1) 
igh- and low Its policy to attract the votes of the movers. 
ter off, becat. while the high-type candidate may have an 


ie investment, i {0 promise more redistribution in the first period, 
are not credible. The Downsian model cir- 
culty by assuming that parties just care 


#8 and thus can commit to implement any plat- 
Se. 
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Hence, there exists an equilibrium policy se- 
quence with period-one policy choice (1, a, 
0) and a probability distribution over second- 
period policies selecting (1, @) with probabil- 
ity '/, and (0, 0) with probability '⁄. 


In this example, the first-period policy 
maker does not undertake the investment, even 
though nobody apparently loses from it, be- 
cause it makes certain a loss of control. The 
beneficiaries of the investment switch alle- 
giance and ‘‘gang up’’ on their former politi- 
cal allies, eliminating the possibility that a 
citizen who shares the first-period policy 
maker’s preferences will be elected in period 
two. In this way, condition (i) of Proposition 
3 is violated. The basic idea that current policy 
makers may distort current policies to influ- 
ence the identity of future policy makers 
would appear quite general, with the papers by 
Aghion and Bolton (1990) and Milesi-Ferretti 
and Spolaore (1994) representing other pos- 
sible applications. 


Example 3: Suppose that the population is di- 
vided into three groups: poor, middle class, 
and rich. Let Ne, Ny, Np denote the sets of 
rich, middle class, and poor individuals, re- 
spectively. Suppose that the number of poor 
citizens equals the number of rich and middle 
class citizens: i.e., #Np = #INy + #Ng. As- 
sume that individuals’ preferences take the 
form: u(x, l} = x — 17/2. Let’a, denote the 
first-period ability of individuals in social class 
J (J € {P, M, R}) and assume ap < ay < 
ar. The public investment raises each middle 
class individual’s ability by an amount ô € (0, 
ar — Ay), but does not affect poor or rich in- 
dividuals’ abilities. The public investment is 
again costless, which implies that it is straight- 
forwardly Pareto improving. 

We demonstrate the existence of a political 
equilibrium in which-the investment is not un- 
dertaken. To solve for the second-period pol- 
icies which would be-selected by each type of 
citizen, observe that with this utility function, 
labor supply, and hence earnings, depend only 
on t. Hence, if (t, T2) € Z(e), then T, = 
b'l, g) where y(t, g) is mean second- 
period earnings when the tax rate is t, and the 
investment decision is g. The optimization in 
(4) is thus simplified to t; (g) = arg max v(t, 
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ty V(b, 2), a;(g)). Using the envelope the- 
orem, we obtain the following first-order 
condition: 


Y(t, 8) — Y (bi, ax (8)) 


+ tu OV( ti, 8)/ Ob = 0 (=0 


Since y(t, a) = (1 — t)a?, the optimal tax 
rates of the three types of citizens can be 
solved for as 


if t; > 0). 


pe 2 
tig) = max( 0 A(g) — an(8) 


” (2A(g) — au(8)°) 


JEe{P,M,R}, 


where A (8) = [#Npaop(g)* + #Nyam(8g)? + 
#Npdor(g)*]/N. Thus a poor citizen chooses 
a positive tax rate, while a rich individual 
chooses zero taxation. A middle class individ- 
ual might choose a zero or positive tax rate 
depending on the parameters. 

Irrespective of the public investment decision, 
there is an equilibrium of the second-period elec- 
tion in which a poor individual runs and ties with 
a middle class individual. Poor citizens vote for 
the poor candidate, and the middle class candi- 
date is supported by the rich and middle class. 
Anticipated voting behavior is such that in a 
three-way race between one candidate from each 
group, either the poor candidate would win or 
the poor and middle class candidate would tie, 
thus ensuring that no rich citizen has an incentive 
to enter. When the public investment decision is 
g, the probability distribution over second- 
period policies generated by this equilibrium se- 
lects (tam(g), tım (8)' ¥(taw(g), @)) with 
probability '/ and (tp(g), tp(g) ¥(te(g), 
g)) with probability '/. 

In the period-one election, observe first that, 
since the investment is not costly, (tiz, Tu) = 
(t2,(0), T2,(0)) for all types J € {P, M, R}. 
Whether the investment is undertaken depends 
on its effects on period-two policy choices. By 
raising the income-generating abilities of the 
middle class, the investment raises mean 
second-period earnings at any positive tax 
rate. This implies that any given tax rate pro- 
duces a higher guarantee. It also raises the tax 


first-period policy maker would forgo! 






























rate that would be chosen by a poor canis 
and, provided that thy(0) > 0, reduceg amie 
chosen by a middle class candidate. 5 

The middle class benefits directly fro 
investment in period two. While there 
to the middle class in terms of a higher tay 
if a poor candidate is elected, it can reagjy, 
shown-that this is never sufficient to outw 
the direct benefits of the investment. Th 
middle class policy maker would alwa 
vest. A rich policy maker must weigh 
increase in the tax rate chosen by a seck 
period poor candidate against the benefi; 
the higher mean second-period earning 
any reduction in the tax rate chosen by a 
dle class candidate. A poor policy m 
weighs up the higher guarantee if the poora% 
didate is elected against a (possibly) 
guarantee if the middle class candid gear 
1) < faw(0)-F(taw(0), 0)). If this i inequal 
is not satisfied, then the investment is ung 
biguously desirable from the viewpoint d 
poor policy maker. 


a rich candidate would wish to undertaké i 
When #Ny/N is small, the gains to the peg 


#Nu/N is small the costs in terms of ane us 

crease in the poor candidate’s optimal s 
period tax rate become small, but the bente 
in terms of a reduction in the optimal tax 
of the middle class remain. This yields the 
lowing result. 


LEMMA 1: Suppose that (az + ak 
ai,. Then there exists Å > 0 such thata 


vestment and a rich first-period policy 
would undertake it if#Ny/IN<). ` 


PROOF: 
See Appendix. 


Thus, if #Ny/N is small, there is 
librium of the first-period election invol 
middle class candidate running against 4 
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; The rich and middle class vote for 
reduc dle class candidate, while the poor vote 
or candidate. Each candidate is 


tly fr <4 with probability 14. The probability 
here is’ “bution over first-period policies gener- 
gher tay by this equilibrium selects (t(0), 
an readilie -F(t (0), 0), 1) with probability '/ 
: tO outy ad (t(0), te (0): ¥(t2p(0), 0), 0) with 
lent obability '/. In this case, there is an equilib- 


a policy sequence with period-one policy 
ee (be(0), top(0): ¥(t2p(0), 0), 0) and 


ind-period choices of (t2p(0), trp(0)- 


ie benef = to(0), 0)) and (tu (0), bmul0): Y(tm(0), 
earnings wa ) with equal probability. 

sen by aing ay 

olicy kin this example, the first-period policy 
‘the p B: mäker does not undertake the investment be- 
ssibly) lowe cgnse.it changes the policy preferences of a 
candidat W gature policy maker in a way that is detri- 
1)- FC Hg atal to the incumbent. Specifically, it in- 
his ineqi g a future policy maker to prefer a lower 
aent is zuarantee; which violates condition (ii) 


‘Proposition 2: Again, the basic idea that 
eurent policies might be manipulated to influ- 
gace the choices of future leaders would ap- 
pear quite general, with the paper by Persson 


avestmetitagmme 22d Svensson (1989) representing another 


» undertake cation. 

ns to thë De gether, the three examples reflect the im- 
n earnings ance of future policy for current choices. 
f less re ‘first example, future compensation is 
7 maker, ed to make the investment worthwhile 
arms of af Et incumbent. In the second and third, the 
ptimal seca endogeneity of future political outcomes 
rut the berg saves the results. In these latter examples, it 
ptimal tax that-the public investment alters citi- 


pteferences for future redistributive tax- 
he arguments here are relevant more 
uly when current policies alter citizens’ 
: ices Over future policies. Thus, they are 
ich that . ly to be relevant for small projects. In 
to apply them to public goods that gen- 
I ki consumption benefits, we would 
<A. z iite rather strong assumptions about pref- 

i nE S0- that- benefits could “alter citizens’ 


à a owever, since political equilib- 
> not to be unique in a citizen- 


candidate model, no efficiency proposition is 
available for the uniform benefit case. This is 
because citizens’ may anticipate a different fu- 
ture political equilibrium when an investment 
is undertaken. This may be true even if the . 
distribution of benefits implies: that a similar 
political equilibrium exists with the invest- 
ment. Condition (i) of Proposition 3 may then 
fail because of the way in which citizens an- 
ticipate future political outcomes to change. 
While Example 2 appears to be a more com- 
pelling instance of the failure of this condition 
(not resting on multiple political equilibria), 
it is clear that the prognosis for general effi- 
ciency results in dynamic models of political 
equilibrium is even poorer than the above ex- 
amples might suggest. 


IV. Discussion 


The results leave no doubt that public in- 
vestments satisfying standard criteria of effi- 
ciency will not necessarily be adopted in 
political equilibrium. However, in view of the 
efficiency result in Proposition 1, it is not clear 
whether these should be viewed as ‘‘political 
failures’? or whether technological definitions 
of efficiency should be questioned. 

Despite the widespread use of the term ‘‘po- 
litical failure’ in public choice writings, it is 
hard to find any precise definition of this idea. 
It seems desirable to have a definition that par- 
allels the usual definition of a market failure 
as closely as possible. Recall, therefore, the 
textbook analysis of market efficiency. First, 
the set of efficient allocations is characterized 
(graphically, the utility possibility frontier). 
This is a purely technological notion of effi- 
ciency, since the frontier depends only on the 
tastes and technologies of the economy. The 
second step requires a model, such as that de- 
veloped by Arrow-Debreu, to specify how 
markets allocate resources. The idea of market 
failure then comes from observing that, under 
certain conditions, markets do not result in al- 
locations that are on the frontier. The term 
‘‘failure’’ is justified by the observation that, 
in principle, all citizens could be made better 
off. 

To provide a parallel definition of political 
failure, one must similarly begin by defining 
the set of technologically feasible utility 
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allocations. This should reflect the available 
policy instruments. It is unreasonable, for ex- 
ample, to hold representative democracy to the 
Standards of what might be achieved were 
lump-sum taxes and transfers available, while 
restricting policy makers to using only distor- 
tionary or Uniform taxation.2' At the second 
stage, political institutions are modeled, with 
a focus on whether equilibrium policy choices 
result in utility allocations on the frontier. By 
analogy with markets, a political failure arises 
when equilibrium policy choices leave the 
possibility of feasible, Pareto-improving pol- 
icy choices. 

By this standard, the nonimplementation of 
potentially Pareto-improving public invest- 
ments described in the previous section rep- 
resents a political failure; if such an investment 
is not implemented then there exists a Pareto- 
improving policy sequence. There is no con- 
flict between this finding and Proposition 1, 
since that result says only that, conditional on 
second-period policies being selected through 
the democratic process, there exist no alter- 
native first-period choices which can raise the 
expected utilities of all citizens. Hence, if a 
potentially Pareto-improving public invest- 
ment 1s not undertaken, then it must be be- 
cause the resulting second-period equilibrium 
choices would have made some citizens worse 
off. Thus, making future policy choices 
through the democratic process constrains so- 
ciety to the interior of the set of technologi- 
cally feasible utility allocations. 

We anticipate two kinds of objection to our 
claims of political failure. The first accepts our 
definition, but disputes the way in which we 
have analyzed the issue. The second takes is- 
sue with the definition. In the first category, 
we include a Coasian-style challenge to our 
analysis, which argues that bargaining be- 
tween citizens and policy makers should be 
able. to eliminate the inefficiency. It is straight- 


cae It is common in the literature to overlook this point. 
Consider, for example, the textbook argument that repre- 
sentative democracy produces an inefficient level of public 
goods. The level of public goods demanded by the median 
voter is claimed to be ‘‘inefficient”’ because, in general, it 
does not satisfy the Samuelson rule. The latter is, however, 


derived under the assumption that lump-sum taxes and 
transfers are feasible. 
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forward to see that an incumbent policy 
would implement the investment in retumi 
future policy makers setting second-pgy, 
policies which would compensate them, Byig§ 
noring such gains from bargaining, our ang gee 
ysis might be regarded as misleading, jug 2 
Ronald Coase (1960) criticized the usual: 
proach to externalities in a market conte 
In response, we observe that the kind o 
gain needed to restore efficient. policy wo 
require a significant amount of commitng 
future policy makers would have to commiy 
the policies that they would select in the g 
ond period, and citizens would have to conim 
to elect these individuals in the future, Thy 
the Coasian bargain would have to invo 
significant fraction of the polity. It dogs 
seems unreasonable to presume that trang. 
tions costs would hinder the undertakin 
such bargains.” mo 
A further objection to the analysis is that&¥ 
political failures identified here could be eli 
inated if policy makers were elected for 
periods. (This is immediate from Propositi 
2.) However, our model provides no juki 
cation for holding a second-period elecit®: 
While technically correct, this critique ts 
the model’s structure too much at face v 
In reality, democracy is characterized by ff 
riodic, rather than one-time, elections and 
model captures this as simply as possibtg 
Given that it is a commonly observed feat 
of democratic government, it is reasonabltt 
suppose that periodic elections have a ra 
ale. This suggests that any benefits from® 
exceed the social cost from the policy d 
tions identified here. However, this is diff 
from arguing that the distortions that we! 
identified do not exist. They will arise 1” 
system with periodic elections where 8% 7 
ment policies have effects that last beyo di 
time of the next election. 
The main objection to our definition 
argument that it is insufficient to demot 
a political failure by showing that the 
some technologically feasible policy “ 





22 Indeed, one suspects that transactions cS 
main reason why societies resort to more deleg: 
of decision-making (see Dixit [1996] fot 
discussion). 



















































duce a Pareto-superior utility alloca- 


tead, it behooves the analyst to specify 
cond-p ternative institutional arrangement for 
them, g policy decisions which actually selects 


» ts misleading. Similar objections 
gaa of course, be made about the notion of a 
t failure. In particular, there is a need to 
‘onstrate that the government (or some 
; er institution for allocating resources ) 
commi auld do better (see Buchanan [1962] and 

‘iver Williamson [1994] for discussions 


.to co 
sct in the ig these lines). 
ave to co “Jä response, it is important to make clear 
future. at.we are not attempting to answer the 
> to invol  kroader question of whether representative de- 
y. It docs Mall’ ssaeracy is desirable in the class of institutions 
; that trag teat could be used to make collective choices 
ndertakin, although answering this would constitute an 
fe isteresting research agenda). Clearly, the abil- 
ysis is that ¥ to implement potentially Pareto-improving 
could be elma public investments represents only one di- 
lected for MÆ: seasion of this question. What is important is 
m Proposito reach a consistent position on markets and 


mae caveraments. Our definition is intended to 
achleve that. Just as the presence of market 


critique 1 Gilares does not imply abandonment of the 
ı at face VMB market system, the existence of political fail- 
terized IVM Eta does not mean that representative democ- 
sctions and should be forsaken.” Indeed, the revealed 
y as pos E peference for representative democracy, con- 
yserved fi Emed even more strongly in recent years, 


Saegests that the benefits of democratic policy- 


; have a rag Sing outweigh its costs. 

nefits from SEE- There are, however, still important reasons 
e policy 4 dk whether representative democracy re- 
this is di efficient policy choices. First, it is 
ns that w | to many issues in public economics. 
vill arise š absence of political failures, the pre- 
; where g puons of normative economics (particu- 
last beyo ose concerning efficiency y may be 


& as predictions about actual policy 


: the market-failure literature, it is all too common 
> policy ge conclusion that the existence of market failure 
; pk government could do better. This stems 

fader of viewing policy as being selected by 
er. Showing that a planner could do better 

Is really no advance over pointing out that 
Sen Bent ologically feasible, Pareto-superior al- 
see Buchanan (1972) for further elaboration of 


i policy choices. Absent this, the term _ 
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choices. If observed policy choices do not 
correspond to these predictions, then, instead 
of berating policy makers for their foolish- 
ness, it is more constructive to provide expla- 
nations for their choices (a conclusion 
reminiscent of Stigler, 1982). The second 
reason is more practical: developing an un- 
derstanding of why political failures occur 
may suggest efficiency-enhancing, piecemeal 
institutional reforms. 


V. Concluding Remarks 


The main achievements of this paper are 
twofold. First, it offers a precise definition of 
political failure. The notion put forward takes 
the standard technological definition of what 
is feasible for the citizens. This, we argued, is 
the most natural parallel to the literature on 
market failure. Second, the paper identifies 
three distinct classes of failures which arise 
when elections are repeated, even though cit- 
izens are forward looking and make optimiz- 
ing decisions. Potentially Pareto-improving 
public investments may not be introduced be- 
cause of fears that compensation needed to 
cover current costs will not be delivered and 
the fact that such projects may change the 
identities and/or policy choices of future lead- 
ers. These reasons can be traced to the fact that 
preferences over policy extend into the future, 
while political control does not. 

We acknowledged that a Coasian argument 
could be levied against our claims of political 
failure; bargaining between citizens over cur- 
rent and future policies could eliminate the in- 
efficiencies. However, this would require the 
ability to reach binding agreements about fu- 
ture policy, as well as a forum to bring the 
polity together. This vision is too utopian, 
given the transactions costs involved. More 
generally, however the issue of how institu- 
tions might evolve to minimize the economic 
costs associated with repeated elections is 
worthy of further study. ~ i. 

Much remains to be done to understand 
comprehensively the efficiency of resource al- 
location in a representative democracy, using 
the approach developed here and in our earlier 
paper. We have not yet modeled private-sector 
accumulation decisions and the effect 
of policy on these. Introducing imperfect 
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information also seems like an interesting av- 
enue for future exploration. Finally, it would 
‘be interesting to incorporate other political in- 
stitutions, such as pressure groups and legis- 
lative decision-making, into the analysis. 


AREENDIX 


va = 4 


PROOF OF PROPOSITION 2: 
We know that for some citizen j € N, 
(ETA, g*) solves the problem: 


ax V (G, Tı, 2), maY}, a@5(-), g)) 
s.t. (t, T, g) E Zi. 


Suppose that g* = 0. Then, since the invest- 
ment is potentially Pareto improving, we know 
that there exists a policy sequence {(ti, Ti, 
1), 73} such that V;((ti, Ti, 1), 72) > 
Vi (GY, TT, 0), m2(y2, @3(-), 0)) for all i € 
N. We will show that V,((t}, Ti, 1), m(¥3, 
æC), 1)) > VŽ, TT, 0), m(y2, @2(-), 
0)), which will contradict the hypothesis that 
= 0. 
By hypothesis, for all k € A(n(y2, 
a2(-))), 


(tall), Ta.(1)) = ((1), Tz(1)). 
Thus, 72(73, @3(-), 1) selects citizen j’s op- 
timal policy choice (#;(1), T2;(1)) with prob- 
ability one and hence 


Vi((tt, Ti, 1), mC, @3(-), 1)) 


= V(t, Ti, 1), 72) 
> VCC, TT, 0), m( 2, œC), 0)). 


PROOF OF PROPOSITION 3: 
We know that for some citizen j € N, 
(t*, T7, g*) solves the problem: 


Max V,Ch, T;, g), m( Y3, a3(-), g)) 


st. (4, 71,8) € Z. 


Suppose that g* = 0. Then, since {(t*, T*, 
g*), T3 } is a political equilibrium policy se- 


quence, Proposition 1 tells us that Alri) k 
Z3 (0). Thus, since the investment is x 

forwardly Pareto improving, we know’ 
there exists (ti, Ti, 1) € Z, such that ie 
dominates nx, Vi((t}, Ti, 1), 73) > 
Tı, 0) rž) for all i E N. We will sho 


VCC, Ti, 1), ma(Y} œC), 1)) 


> V;( (t7, TY, 0), m2( 72, @3(-), 0)) 


which will contradict the hypothesis that gt j 
0. To do this it suffices to demonstrate ii 
T2( Y2, @3(-),-1) dominates m2( Y3, a%(- 

Recalling the definition of dominang’ 
Section III, there are two conditions to chg 
First, let (44, T4) E Alma(Y}, @4(-), 9 
Then we know that there exists som 
A(ro(3, 0£3(+))) such that (a(1), Ta 
T2). By hypothesis, we have that} 
A(ro(y2, @2(-))) which implies that (t,() 
Tx(0)) E ACm2(¥2, @2(-), 0)). In addita $ 
again by hypothesis, (t2,(1), T2,(1)). 
U(ty(0), Tx(0)). This establishes the fs§ 


To check the second condition, note tif 
(h, T2) € A(m2(y2, $C), 0)) then 


m™(¥5, aS), 0) (h, T2) 


By hypothesis, therefore, we may write 


m(¥2; a3(-), 0) (t, T2) 


But, by hypothesis for all k € {i e NU 
(ta (0), Ta (0)) = (t, T2) } it must be th 


(ta(1), Tx(1)) € (h, Ta) 






















2, 


kE {i © NU (0):(19(0),72{0)) = (nT) >; 


ro(y2, @2(-)) (k). 


py 


kE {iE NU (0}:1y,(0),7240)) = (12,7! 


ro(-y2, œC )) (k). 






















t have ue 


ma( Y} a3(-), 0) (t3, T2 


py 


iS pe (ie NU {0} :(t2i(0).T2(0)) = (12,7) } 


(7s @(-)) (k). 


ne OF LEMMA 1: 
idea of the proof is to treat- the fraction 
dle class individuals in the population 
: eter and to show that as this fraction 
SOMES very small: (i) the gains for the poor 
g the investment go to zero while 
ş remain positive, and (ii) the costs for 
fanch from making the investment go to zero 
i chile the benefits remain positive. The result 
qi then follow from continuity. 
‘Define A = #Nj,/N. Given our earlier as- 
pions, #NpIN = ‘hy and #N,/N = 'h — 


jave that! ia 
ss that Er 
). In additi 
), TaD 
lishes the 


EN) = (ay + bg)? ]}, and y(t, g, 3 = 
(0)) = y A(g, X). Observe for future reference 
bet limo A(1, A) = lim-o A(0, X) = (az + 
ee M ‘lim-o te(1, A) = lim-o hp(0, X) = 
miles = a2)/2]/a3, and 


o, (ae + aż) /2 — (ay + 6)? 
ak tap—(ay+6)? J’ 


-a h e 


the fact that u(t, T, a) = T + he - 
tis straightforward to calculate pe- 
Rae utility for type J citizens 


p 
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al tom (8)*¥ (tom (8), 8, N) 
+ (1 — ty(g))?-a5/2] 
+ 31 trp(g)-¥(te(g); g, A) 
+ (1 ~ tae(g))?-a3/2]. 
Thus, letting, wse(8, A) = hx(8, A): ¥(hx(g), 


g, A) + (1 — blg, A))?-a5/2, for J € {P, 
R} and K E€ {P, M}, we know that a poor . 


‘policy maker will not undertake the invest- 


ment if 'A[wpp(1, X) + wey(1, )] is smaller 
than '/,[wpp(0, A) + wpy(O, X)], while a rich 
policy maker will undertake the investment if 
Pof wep(1, A) + waw(1, X)] is larger than 
“h[ wep(O, A) + wruC0, d)]. 

Note that for J€ {P, R} and KE {P, M} 


wx(0, A) — wix (1, A) 
= (1 — fx (0, )) [fox (0, 4A (0, d) 
+ (1 = hx(0,d))a5/2] 
= (1 = t(l, N) De, ACL, A) 
+ (1 — fx(1,))a3/2]. 
It is clear from our claims above that lim, 
Wyp(1,X) = limo wp (0, A), for JE {P,R}. 


However, 


tim Lwm(0, A) — wm(1l, A)] 


= fi tim #100, »)- fim ef (i v); 


where f(t) = (1 — t)[t(a} + a})/2 + (1 — 
t)a3/2] for J E€ {P, R}. Differentiating the 
function f,(t), we obtain fj(t) = (a2, — 
a?)/2 — ta?;, J € {P, R}, where -J = R 
when J = P and — J = P when J = R. Observe 
that f(t) > 0 for all t = (ab — a3)/2a}. 
Thus, since limy.9 tm(l, A) < limo tm(0, 


A) < (a2 — a2)/2a%, we have that lim-o .~ ., 


wWpu(1, A) < lim-o wpm(0, à). In addition, 


- we have that f(t) < 0 for all t which implies 


that lim,—=o Wry ( 1, `) > lim,9 Wp (0, `). By 
continuity, therefore, for sufficiently small A, 
'A[wpp(1, X) + wpy(1, A)] is smaller than 





"bl wpp(0, X) + wpu(0, d)], and 'A[war(1, 
A) + wrm( I, A)] exceeds ‘hE wrp(0, A) + 
wrm(O, d)]. 
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1 act is altruistic if it confers a benefit on 
one else while imposing a cost on its per- 
stator, How does costly altruistic behavior 
? Why doesn’t utility maximization in- 
eacaably eliminate such behavior? 

«One answer is immediately available: alleg- 
altruistic acts are not really altruistic. 
pan closer examination, they confer net ben- 

&ts rather than costs. For example, charitable 
dsaations may bring benefits such as public 
secognition or a warm glow that overwhelm 


pi ‘gost of the donation. If we push revealed 
op. 21-30 Exetefence theory to its logical limit, this con- 
Telfare, Mi stusion becomes as inescapable as it is tauto- 
of a Line gical. If someone commits an ‘‘altruistic’’ 


ñ this reveals that he prefers doing so.' 
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Many parallels between discussions of al- 
Sconomics and biology. Biologists frequently 

the full effects of actions on reproductive 
Recognized, seemingly altruistic acts are pri- 
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“We study a population of agents, each of whom, can be an Altruist or an Egoist. 
“ Altruism is a strictly dominated strategy. Agents choose their actions by imitating 
others who earn high payoffs. Interactions between agents are local, so that each 
agent affects (and is affected by) only his neighbors. Altruists can survive in such 
a world if they are grouped together, so that the benefits of altruism are enjoyed 

rimarily by other Altruists, who then earn relatively high payoffs and are imi- 
‘tated. Altruists continue to survive in the presence of mutations that continually 
introduce Egoists into the population. (JEL C70, C78) 


A second answer is also available: the in- 
teraction in which the altruistic act occurs 
may be repeated. If the interaction is infinitely 
repeated, then the folk theorem (Drew 
Fudenberg and Eric Maskin, 1986) ensures 
that there are equilibria in which Altruists 
survive, though there are also equilibria in 
which altruism does not appear.’ David M. 
Kreps et al. (1982) show that there are equi- ° 
libria in which Altruists survive in finitely re- 
peated games with incomplete information, 
though once again a folk theorem result ap- 
pears, including equilibria without altruism 
(Fudenberg and Maskin, 1986). 

We do not doubt that people often derive 
benefits from seemingly altruistic acts, and 
that many interactions are repeated. How- 
ever, we also believe that altruistic acts occur 
for which conventional models do not readily 
account. Embellishing the models to encom- 
pass such acts often leads to utility functions 
that are uncomfortably exotic or to an uncom- 
fortably strong faith in repetition. 

This paper provides an alternative model of 
altruistic. behavior with two key properties. 
First, we abandon the assumption that people 


vately optimal. For example, there are now numerous ex- 
planations for why it may be-individually beneficial for a 
bird to attract a predator’s attention by uttering a cry that 
alerts the bird’s flock to the predator’s presence. Richard 
Dawkins (1976) discusses bird alarms as well as other 
cases of apparent altruism. 

2 In biological contexts, explanations for altruism sim- 
ilar to those that arise in repeated games appear in the 
guise of reciprocal altruism (Robert L. Trivers, 1971). 
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are rational agents choosing utility- 
maximizing actions. Instead, we believe that 
people must learn which actions work well, 
and that an important force in learning is im- 
itation. Second, interactions between agents in 
our model are ‘‘local,’’ meaning that altruistic 
acts are more likely to affect nearby agents 
than more distant neighbors and that agents are 
more likely to imitate nearby than more distant 
neighbors. , 

To see how these forces can allow altruism 
to survive, suppose there are two kinds of 
agents: Altruists, who provide a public good 
to their neighbors at a cost to themselves, and 
Egoists, who do not do so. Suppose further 
that Altruists tend to exist in concentrated 
groups. Altruists can then earn higher payoffs 
than Egoists, because Altruists are more likely 
to enjoy the public goods provided by other 
Altruists. The imitation-based learning process 
now prompts other agents to become Altruists. 
In addition, nearby agents are the ones most 
likely to imitate the Altruists. This preserves 
the tendency of Altruists to clump together in 
groups and hence preserves the conditions 
needed for altruism to survive. 

This argument is unconvincing without 
some stability analysis. We expect perturba- 
tions to occasionally switch the behavior of 
some agents from Altruist to Egoist or Egoist 
to Altruist, perhaps because someone has an- 
alyzed the model and deduced that it is utility 
maximizing to be an Egoist, or has made a 
mistake, or has simply experimented with a 
new action. An Egoist thrust into the midst of 
Altruists will thrive on the public goods pro- 
vided by the latter and will be imitated, while 
an Altruist thrust in the midst of Egoists will 
fare poorly and will be ignored. Perturbations 
or ‘‘mutations’’ that occasionally cause people 
to switch strategies thus apparently produce a 
force pushing toward egoism. An explanation 
of altruistic behavior must demonstrate that al- 
truism can withstand such mutations.” 


* Group selection models share many of the features of 
our model of local interaction, but are now widely consid- 
ered to be implausible as explanations for altruistic -be- 
havior, largely because of their inability to withstand 
mutations. We discuss the relationship of our work to 
group selection arguments in Section V, 
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We find that only states composed prj 
of Altruists survive in the presence ofr 
tations. If a mutation introduces an Eppigc 
the midst of Altruists, then the Egoist wil i 
vive and spread. However, the resulting 
of Egoists quickly confronts limits on į 
ity to expand, as each expansion cay 
public goods supplied by neighboring A 
ists to be shared among more and more Ro 
and hence reduces Egoists’ payoffs. Ẹ 
are thus readily introduced but cannot exp 
beyond small, isolated groups. Isolated Als 
ists, in contrast, cannot even survive jn a 


casionally introduce a group of Altruists ig 3 
midst of Egoists. Such a group of Altruists¢g 3 
expand without bound. Mutations thus 
readily lead to large groups of Altruis 
Egoists, allowing the former to dominate, §# 

Altruism is not the only type of externs 
that can arise between agents. We extend! 
model to consider Hooligans, or agents y 
benefit from imposing damages on the 


or Egoists to survive in the midst of Hoa 
gans, though Hooligans will typically nol 
eliminated « entirely. The analysis is thenleg 
ther extended to general 2 xX 2 games, ale 
ing us to examine games with two strictÑ 
equilibria, one payoff dominant and on 
dominant. 
The work most closely related to oui 
cludes Theodore C. Bergstrom and Oded $8 
(1993), Lawrence E. Blume (1993) 
Glenn Ellison (1993), and papers by ! 
A. Nowak and Robert M. May (1992, 1%: 
and Nowak et al. (1994). Our spatial st ; i 
matches one of the models consideré 
Bergstrom and Stark as well as the sil? 
case considered by Ellison, while Blut 
Nowak and May examine spatial. move. 
which agents are arranged in a plane! 
than along a line. We differ from Elli 
Blume in taking imitation, rather th 
variant of best-reply dynamics, to be 
ing force behind strategy selections. +4 
crucial, as altruism has no hope in a W®. 
best responders. We differ from Nowak 
May in relying on analytical techniqu®, 
than simulations, albeit for a very $& 



































MARCH 33 NO. ! 

ed prim a and especially in studying the effect of 
20fr fs oe ons. 

an Egor. e.-tion I presents the model. Section II ex- 
oist w tnes equilibria of the imitation process. 
ulting saun are many possible limiting outcomes, 


” eding upon the initial conditions of the 
“4 apade Past comprise a significant 
oring AREE < ion of the population in all but one of 
30 Oe We establish conditions under which the 
offs. Egonak: probability of an initial condition leading to 

‘elimination of altruism shrinks to zero as 


olated Alii se population grows large. Section III shows 
irvive init t only those limiting outcomes with a sig- 
tions will¢ gificant proportion of Altruists survive muta- 
ltruists j fons, Section IV pursues a generalization of 
' Altruis the model in which agents interact in larger 
ns thus seighborhoods. In doing so, we find that it can 
A Itruists be to Altruists’ advantage to have a relatively 


high cost of altruism. A higher cost of altruism 
easures that if Altruists survive, then they must 
i do so in larger groups, because only then do 


r agents WWM they share enough of the public good to com- 
ges on thi@e. sensate for the high cost of being an Altruist. 

allow Alli This in turn ensures that if there are any Al- 
f Egoist “tulsts at all, then there are relatively large 


; oups of Altruists. Section V concludes. Un- 
cea kss otherwise noted, proofs are contained in 
pically ntb: the Appendix. 


uM 
z 
oO 
m 
È 


I. Altruists and Egoists 


We consider a collection of N individuals, 
Where N is finite. Each individual can be either 
4,8 Altruist or an Egoist. An Altruist provides a 
Hblic good that contributes one unit of utility 
those who receive its benefits. The net cost 
iers by MAMES: © the Altruist of providing the public good is 


(1992, 1% 0, so that the combination of enjoying the 
patial strua benefits of his own public good and bearing the 
onsider! of its provision reduces the Altruist’s util- 


C. Egoists provide no public goods and 


ile Blume Ro costs. Instead they simply enjoy the 
tial. mod befits of the public goods provided by others. 
a plane Boa gum Is divided into discrete periods. At the 
ym Ellis = Of each period, after consuming any public 


that is available and bearing provision 

t an Altruist), each agent decides, ac- 
E | RA ‘to a learning rule, whether to be an 

einaW or Egoist in the next period. 

om Nowo Os fature of this learning rule is important. 
bility is that the agents are fully ra- 

though myopic, and the learning rule 
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leads them to adopt single-period, expected- 
utility-maximizing actions. In this case, they 
will realize they face a variant of the prisoner’s 
dilemma and will play the strictly dominant 
strategy, namely Egoist. Instead of choosing 


` best replies, however, our players imitate the 


strategies of others whom they observe to be 
earning high payoffs. 

At one level, this imitation seems prepos- 
terous. How hard can it be to figure out that 
being an Egoist (or defecting in a prisoner’s 
dilemma) is a strictly dominant strategy? This 
is indeed a trivial task for a game theorist fac- 
ing the sterilized 2 xX 2 games with which we 
often work. However, these games are a sim- 
plified representation of a much more compli- 
cated reality. The agents who actually play the 
game may not recognize that they are playing 
a game, may not know who their opponents 
are, may not know what strategies are avail- 
able, and may not know what payoffs these 
strategies bring. They may then be unable to 
think like game theorists, or like the agents in 
game-theoretic models. At the same time, we 
believe that people are generally able to form 
a good estimate of others’ payoffs, whether 
these payoffs are measured in terms of money 
or other units such as social status or prestige, 
and that people tend to imitate the behavior of 
those they observe earning high payoffs. 

Imitation alone appears to hold out no hope - 
for the survival of altruism. Egoists will enjoy 


‘the same public goods as Altruists, while only 


the latter bear costs. As a result, all Egoists 
will earn higher payoffs than all Altruists and 
imitation can only lead players to become 
Egoists. 

This argument is applicable only if the ben- 
efits of the public good provided by each Al- 
truist extend to every agent in the population. 
The prospects for Altruists improve if the pub- 
lic good is a local public good. To make this 
precise, we introduce’a neighborhood structure 
taken from Bergstrom and Stark (1993) and 
Ellison (1993). Agents in the model are lo- 
cated around a circle.* Each agent interacts 


4 A spatial interpretation is convenient, but local inter- 
action structures may arise in other ways. In academia, 
field of specialization is probably more important than lo- 
cation in determining patterns of interaction. 
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with his two immediate neighbors, i.e., with 
one agent to his right and one to his left. If an 
agent is an Altruist, then his immediate neigh- 
bors enjoy the benefit of his public good pro- 
vision. The payoff of agent i is then given by 
N4 — C if i is an Altruist and N¢ if i is an 
Egoist, where Nf € {0, 1, 2} is the number 
of i’s Altruist neighbors (excluding himself). 

In each period, each agent takes a draw from 
an independent Bernoulli trial, causing the 
agent to “‘learn’’ with probability u € (0, 1] 
and to retain her strategy with probability 1 — 
p. An agent who learns observes her own pay- 
off and the payoff and strategy of each agent 
in her neighborhood. She then chooses to be 
an Egoist if the average payoff of the Egoists 
in her sample exceeds that of Altruists, and 
chooses to be an Altruist if the average payoff 
of Altruists exceeds that of Egoists.° If an 
agent and her two neighbors all play the same 
strategy, be it Altruist or Egoist, then the agent 
will continue to play that strategy. 

We shall concentrate on the case of u = 1, 
so that every agent learns in every period. This 


gives a deterministic learning process which ` 


simplifies the derivation and statement of the 
results. However, Bernardo A. Huberman and 
Natalie S. Glance (1993) have recently argued 
that the outcomes of local interaction models 
can be sensitive to whether all agents adjust 
their strategies at the same time. We accord- 
ingly comment on how each of our results 
would be modified if u < 1. 

A state is a specification of which agents are 
Altruists and which are Egoists. Let S be the 
set of possible states. For states i and j in S, 
let Py be the probability that a single iteration 
of the imitation process changes the system to 
the state j given that the current state is i. 
Since the learning process is deterministic 
(with u = 1), P; is either 0 or 1. The collection 


5 We simplify the analysis by choosing the cost C so 
that payoff ties do not arise. There are many other plau- 
sible learning rules. For example, an agent may simply 
compare the best Egoist and best Altruist payoff among 
those payoffs she observes, or may compare the sum of 
the Egoist and Altruist payoffs, rather than considering 
averages. Itzhak Gilboa and David Schmeidler (1995, 
1996), in the context of their case-based decision theory, 
examine the difference between considering the sums or 
the averages of payoffs. 




























{Pi}: jes, along with a specification of 
initial state at time zero, is a Markoy 
on the state space S. We refer to this 
process as the ‘‘imitation dynamics.” 


Il. Equilibrium 


A. Absorbing Sets 


We are interested in the stationary digs 
butions of the imitation dynamics. We say 
a set of states is absorbing if it is a mining! 
set of states with the property that the Mari $ 
process can lead into this set but not ov 
An absorbing set may contain only one g 
say i, in which case P; = 1 and iis a station; 
state of the Markov process. An absorbing 
may contain more than one state, in whic = 
P; = 0 if i is contained in the absorbing: 
and j is not, while the Markov process cy} 
between states in the absorbing set. 

For each absorbing set of the Markov p! 
cess, there is a unique stationary distribute 
the support of which consists of that absorbiy 
set. We can then learn much about the statis 
ary distribution of the learning process 
studying absorbing sets. 

We begin by compiling a description of & 
imitation dynamics. We assume C < ‘h' A 
the end of each period, an agent may eit? 
retain her strategy or choose a strategy play 
by one of the two agents closest to her, 
pending upon their payoffs. These payoffs 
turn depend on the strategies of the next 
neighbors. The fate of an individual is 
completely determined by the strategies of 
four nearest neighbors. Phy 

An Egoist who learns by imitating #% 
neighbors can become an Altruist only if 
least one of his two nearest neighbors 8" gs 
Altruist. However, if both of his immedi, 5 
neighbors are Altruists, then the Egoist e% 


€ We can always vanquish altruistic behavior b 
ing it too expensive. If C > '/, we find that 
absorbing sets of the learning process are those CoP i 
of single states in which either all agents are Alt ‘be ete tO Te: 
all are Egoists. The former contains only itself in 6%; s 
of attraction, while the latter attracts the remain 
state space. In the presence of mutations, only 
absorbing state survives. The case of C = '/, lea? 
convenient payoff ties. 3 































sf of two, more than an Altruist can ever 
‘causing the Egoist to retain his strategy. 
Egoist can therefore become an Altruist 
exactly one of his neighbors is an Al- 
"2, %aith the Altruist neighbor earning a 
aa yoff-than the average payoffs of the 
and his Egoist neighbor. This payoff 
ty holds only if the Altruist has an Al- 
st'neighbor, since otherwise the Altruist re- 
“the lowest possible payoff of —C, and 
other Egoist in the neighborhood faces 
ighborhood containing only Egoists, so as 
g'the average Egoist payoff below 1 — 
Hence, an Egoist can become an Altruist 
f he faces either the following combi- 
son of strategies or its mirror image, where 
1 represents an Altruist and “E” an 


y distrib 
hat absorbis 
ut the statiog similar calculation shows that an Altruist 


remain an Altruist if and only if one of 


xa a ax 


aa a EE 


on and where an ‘‘x’’ holds the place of 
ragent who may be either an Altruist or an 


In all other cases, Altruists change to 


BS 
e> Goaditions (1) -—(2) provide a complete de- 
<Puon of the individual imitation dynamics. 
Sillustrate some absorbing sets, we represent 
=dgents as being located on a line, where 
think of the ends of the line as being joined 


>. wip e~ 


find that J 
we those cok 


to use “A” We will also often separate 
whom We are interested by spaces, as in the case 
ual Egoist here, though these spaces have no 
than directing attention to particular 
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to form a circle. From ( 1)-— (2), we easily ver- 
ify that the following are absorbing sets: 


e The state in which all are Altruists. 

e The state in which all are Egoists. 

e A state in which all are Altruists except two 
adjacent Egoists: mie 


-+ + qaaaaaaaEEaaaaaaaa:::. 
e A set of two states, consisting of: 


-+ + aaaaaaaaaEaaaaaaaaa: 


-- + qaaaaaaaEEEaaaaaaaa::: . 


In this last case, the imitation dynamics cycle 
between the two states in the absorbing set. 
The lone Egoist initially earns the highest pos- 
sible payoff of 2, inducing his two neighbors 
to become Egoists and leading to the second 
state in the cycle. Each of these new Egoists 
finds himself in the situation described by (1), 
where he has two Egoists on one side and two 
Altruists on the other. This causes the new 
Egoists to switch back to altruism, beginning 
the cycle anew. We refer to such a cycle as a 
blinker. 

The two outside agents in the blinker face a 
coordination problem. It is an equilibrium for 
one but not for both to be an Egoist, and the 
learning scheme causes them to cycle around 
this equilibrium. We suspect that cycles in be- 
havior do occur, though our simple model cap- 
tures these cycles in a crude way. The presence 
of blinkers is a product of setting u = 1, forc- 
ing all agents to assess their strategies in every 
period. If u < 1, then blinkers are no longer 
absorbing sets, since a period will eventually 
arise in which only one of the two outside 
agents in the blinker revises her strategy, lead- 
ing to a pair of adjacent Egoists. All absorbing 
sets would then be singletons. 

These examples, and combinations con- 
structed from them, include all of the possi- 
bilities for absorbing sets. Some terms will be 
useful in making this precise. If agents a and 
B play the same strategy, either Altruist or 
Egoist, and if all agents between @ and £ play 
this strategy, then we will refer to agents a, A, 
and the intermediate agents as an interval of 


od 





ery 
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z either Altruists or Egoists. We call a maximal 
= such interval a string. Notice that strings may 


be of any length from 1 to N, the length of the 
circle. We then have (the proof is in the 
Appendix) :° | 


PROPOSITION, 1: Let0 < C < 'h and p = 
1. Then: 

(1.1) Absorbing sets consist of (i) the state 
in which all agents are Egoists, (ii) the state 
in which all agents are Altruists, and (iii) sets 
containing states in each of which Altruist 
strings of length three or longer are separated 
by Egoist strings of length less than four. 
These sets are either singletons (in which case 
all Egoist strings are of length two) or contain 
two states [in which case any string of length 
one (three) in one of the states blinks to a 


string of length three (one) in the other]. 


(1.2) Except for the state in which all 
agents are Egoists, the proportion of Altruists 
in an absorbing state, or the average propor- 
tion over the two States in an absorbing set, is 
at least 0.6. 


Proposition 1 indicates that there are many 
absorbing sets, each of which is the support of 
a stationary distribution of the imitation pro- 
cess. In all but one of these absorbing sets, the 
majority of the population is Altruists. Hence, 
there is no possibility for moderation in altru- 
ism. If Altruists survive at all, they must be the 
majority. 

To see what lies behind this result, we first 
note that a string of Egoists in an absorbing 
set can never be longer than three. If the length 
of an Egoist string exceeds three, then the two 
Egoists at its edges will each have two Egoists 
on one side and two Altruists on the other, and 
hence they will become Altruists [cf. (1)], 
causing the string to shrink. Egoists can thus 
survive only in strings of length two or strings 
of length one (where the latter alternate with 
strings of length three in a blinker). Altruist 
strings must be at least length three in order to 
survive, and surviving Altruist strings can ex- 
pand, since doing so creates more and more 
high-payoff Altruists. This allows us to 


2 If u < 1, then this proposition continues to hold, ex- 
cept that blinkers are no longer absorbing sets. 


- three at some point, after which it disap 


























conclude that if there are any Altruists's 
then Altruists will occur in strings of Je 
least three while Egoists occur in stringi 

most two (or in blinkers the average a. : 
which is two), and hence there will be ati 
60-percent Altruists. 


ne: tenn 


B. Basins of Attraction 


Because the state in which all agents 
Egoists is absorbing, the system may dri 
truists to extinction. To assess the likelj 
of such an event, we identify the initial; 
ditions from which the system converges {g 
absorbing set containing Altruists. 

The proof of Proposition 1 shows that 


3 


string of Altruists either drops below le 


or persists forever. We refer to a string of 
truists whose fate is the latter as a ‘‘persisté 
string. The system will converge to a sta 
which at least 60 percent of the agents are 
truists if and only if the initial condition o 
tains at least one persistent string. 

The following proposition first characte 
persistent strings. We then suppose 
agents’ initial identities as either Altruisté, 
Egoists are randomly determined, and in 
tigate the probability. that this leads to an ini 
state containing a persistent string of Altruse 
We have:® 


PROPOSITION 2: Let 0 < C < ‘hand 
1. Then: 
(2.1) A string of Altruists is persiste gy 
and only if (i) the string contains at least 
Altruists, (ii) the string consists of four AK 
ists bordered on at least one end by two bf 
ists, or (iii) the string consists of 
Altruists bordered on each end by two Egoz 
or bordered on at least one end by thre 
ists. All other strings of Altruists are dl 
nated by period three. 
(2.2) If agents’ initial identities as 
ists or Egoists are determined by indepem 
identically distributed random variables 


9 This result holds if » < 1, with the modificati™ 
some strings of two Altruists, as well as strings of 
aEaaaEFE, survive with a probability greater 
but less than one. 
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truis ive probability on Altruist, then as N 
4 =e. the probability of a persistent string 
ist in the initial state, and hence con- 
‘to an absorbing set containing at 
60-percent Altruists, approaches unity.. 


i randomly determined initial condi- 
Ta a large population, the probability 
‘altruists survive is high because there will 


Il age st certainly be an initial group of Altruists 
nay driv eenough to ensure their survival, and 
ne likeli to ensure that most agents are eventually 
e initial’ , However, a great deal of growth may 


ired before a single group of Altruists 


nverges {ig be req 
s. HA zaa comprise an appreciable fraction of a large 
ows that? ‘apalation. How long must we wait before 
below len “genet agents are Altruists? l 

it disap s riod three, any string of Altruists that 
, string is not persistent will have been eliminated, and 


‘the population will consist of persistent strings 
of Altrists separated by strings of Egoists. If 
‘a string of Egoists is not already of length two 
a a blinker, then it will contract at a rate of 
fso agents per period, as the Egoists on the 
two ends of the’string switch to altruism. We 


fi 
agents area 
ondition cag 
g. E ae 


: charactena p, KEN, 

supposé ay accordingly pose our waiting-time ques- 
or Altmih A oo as the following: how long do we expect 
sd, and im walt until a string of agents lying between 


wo; persistent Altruist strings has been re- 
sed to length two or to a blinker? But since 
ting time is half of the string’s length, 
ssibly three periods, we can equiva- 
k how long a string of agents we ex- 
find between two persistent strings of 

There is no reason why strings of 
ying between surviving strings of Al- 
ould be longer in larger populations, 


; of fou k 10 
id by twa T ce no reason for expected waiting 
sists 0 to increase as N increases. However, the 
by two Él èd length of such a string is quite sen- 
I by three ; the probability that an agent in the 
ists are Condition is an Altruist. We can 


tities as* 
ry indepell ro . o . > e a 

? viable „agents’ identities in the initial con- 
i andomly and independently deter- 


in probability p attached to being an 


then agents learn less frequently, and ex- 
es will be longer. 


“oi OPOSITION 3: Let 0< C< ‘hand r = 


Altruist. Then, in the limit as the population 
gets arbitrarily large, an upper bound on the 
expected length of a string of agents between 
two persistent Altruist strings is given by: 


we EESE 


Pp Length 
0.5 22 

0.4 34 

0.3 63 

0.2 171 

0.1 1140 
0.05 8480 
0.01 1.01 x 10° 
0.001 1.00 x 10°. 


Expected waiting times are thus relatively 
moderate as long as there is a sufficiently high 
initial probability that a randomly selected 
agent is an Altruist. For example, a probability 
of altruism of 10 percent gives an upper bound 
of 573 on the expected waiting time (half of 
the Egoist string’s expected length, plus 
three). On the other hand, persistent strings 
will be extremely rare if Altruists are very rare, 
and expected waiting times will be very long."' 


C. Hooligans 

Altruism, conferring a benefit on someone 
else at a cost to oneself, is not the only way 
that one agent’s actions may affect another. At 
the opposite extreme we have Hooligans, who 
benefit by imposing harm on others. Notice 
that hooliganism need not be limited to the 
psychopathic. Those who litter in order to 
avoid the cost of disposing of their refuse, 


`U We expect to wait longer until Altruists dominate the 
population when Altruists are rare, but the resulting ab- 
sorbing states are likely to have higher proportions-of Al- 
truists. This result holds because such initial conditions 
will be characterized by relatively small numbers of long 
strings.of Egoists, who will be transformed into small 
numbers of short strings of Egoists. When Altruists are 
more likely, there will be many short but distinct strings 
of Egoists in the initial condition, leading to an absorbing 
state with more strings of Egoists. 
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FIGURE 1. 2 X 2 GAME 


those who pollute rather than take costly 
abatement measures, and those who shirk in 
group efforts are all Hooligans. 

Our model is easily generalized to accom- 
modate Hooligans. Let there be two types of 
agents; denoted by 1 and 2. Let type 1 con- 
tribute K, to the payoff of each of his neigh- 
bors at a cost of C, to himself. Let type 2 
contribute K, to each neighbor at a cost C, to 
himself. There is no loss of generality in as- 
suming that K, > K,. Our model of Altruists 
and Egoists is then the special case in which 
K, = 1,C, = C, and K, = C, = 0. The behavior 
of the model depends only on a single 
parameter: ' 


PROPOSITION 4: Let K, > K,. Then any 
variation in the values of K,, K), C,;, and C, 
that preserves 


C-C 
3 ee 
(3) KK, 


gives rise to the same imitation dynamics. 


Hence, any two specifications of the payoffs 
that preserve (C, — C,)/(K; — K2) give rise 
to the same absorbing sets, basins of attraction, 
and dynamic paths for the imitation dynamics, 
and the same limiting distributions in the pres- 
ence of mutations. 

For the Altruist and Egoist model of the pre- 
_ vious sections, the ratio (3) was C, which was 
“interpreted as the cost of altruism. Consider 


12 The proof establishes this proposition for a wide 
class of imitation dynamics including those of the current 
section as a special case. 

























MARCH 
the following pairs of types of players, 4 
case, the first column identifies the effe; cine 
agent of type 1 has on his two neighbo f te 

the cost to the agent of that effect, whi See 


second column provides analogous inf, 
tion for an agent of type 2. 


-f Kie K, , 
C, C2 


and Egoist pair from previous sections; KEYS W 
second pair of agents consists of an Egoista% 
a Hooligan who enjoys (incurs a negative ce BY iaa 
from) causing damage of one unit to his nig P 
bors. In case this Hooligan seems too mè Era 
cious in his enjoyment of the harm he cas 
the third pair rewrites this situation as anga Raa 
of type 1 who imposes no harm on others $ 
incurs a cost of C to avoid doing so, witiRgg: 
type-2 agent who does not incur the cost $ 
imposes damage of one unit on his neigh. 
The fourth pair includes an Altruist and a Hë 
ligan. The last pair has two Hooligans, omé 
whom causes twice the damage and dove 
benefits from doing so. In each of these 
ifications, the ratio (Ci — C,)/(Ki - 
given by C, and hence these are equiv 
models. As long as C < 'h, it is alway! 
first type in each pair that will come to 
prise the majority of a large population 
randomly determined initial condition. 
gans will then be in the minority when BS 
Egoists or Altruists, though some Hool g 
will survive, just as some Egoists sures 
when paired against Altruists.'* 

Similar insights can be used to exten! 
analysis to general 2 X 2 symmetric 
Suppose each agent must choose a sing 
egy°to use when playing- the game sh 
Figure 1 with each of his neighbors. W# 


' The final specification shows that Hooliga®s 
in the majority if paired with even worse Hooli 
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ers. Ina enerality, we can assume a > d. 
he effec! gill then further concentrate on the case 
ighborg'=s we hiċh a > b. This latter assumption ex- 
ct, whig | Bes: “some games but retains the common 


ous informe ee ‘tes of 2 X 2 games. Then an argument 
oa ~ous to the proof of Proposition 4 shows 
imitation dynamics depends only upon 








two numbers 
c-b _d-b 

0 paS a-b : a-—b- 
g slight of this, we can transform the payoffs 

Figure 1 by subtracting b from each payoff 
) dividing by a — b to obtain the equivalent 
; tion of the game given in Figure 2, 

(c — b)/(a — b) and £ = (d — b)/ 

niliar A (a =b). 
sections. - -We can now classify games according to the 
an Egoist age yalaés of a and 8, where 8 < 1 (because we 
ı negative G Rave assumed a > d). We have: 
it to his neig 4 
2ms toom ‘Prisoner's Dilemma: 0 < 6 < 1, l<a. 
arm he cay Coordination Game:0< 8 <1, a<l. 
ion as anapi # Chicken: 6 < 0, 1<a. 
1 on other cient Dominant Strategy: 6 < 0, 
ing so, WR a <1, 
ar the cost; ae 
his neigh classification is illustrated in Figure 3. 
uist and a$ “efficient dominant strategy’’ game is 
oligans, ‘Ong In which X is a strictly dominant strategy 


q 


ge and dou sad the outcome (X, X) is efficient, unlike 


i of thes joner’s dilemma. A coordination 
(Ky - ; as two strict Nash equilibria, given 

are equiv: X; X) and (Y, Y). Chicken has one 
t is always Strategy Nash equilibrium and two 
| come to é etric pure strategy equilibria.'* In the 





a coordination game, (X, X) is the 
eae dominant equilibrium (because a > 
ity when $ aid’hence B < 1), and is also risk domi- 


some Hoo 2 + B < 1, while the equilibrium (Y, 
igoists $ tisk dominant if æ + 8 > 1. The inter- 
i hich a = 1 + '4C, 2 = '4C, and 


dtoe em shown in Figure 3, describes the range 


ct ons A and B of this section. 


eid) Metric pure strategy equilibria of this sym- 
Ty me relevant if agents can condition their 
sorse Hoolig Some asymmetry, such as location. 


i 


st and Egoist games that was analyzed _ 


FIGURE 2. TRANSFORMATION OF GAME IN FIGURE | 


The methods developed in the previous sec- 
tions to examine Altruists and Egoists can be 
applied to any other game in this classification. 
For example, consider coordination games. 
Let a + 8 > 1 so that (X, X) is the payoff- 
dominant equilibrium but (Y, Y) is the risk- 
dominant equilibrium. Now consider a 
boundary between a group of agents playing 
strategy X and a group playing strategy Y, or’ 


The only agents at risk of changing their strat- 
egies are the two agents, one playing X and 
one playing Y, at the ends of their respective 
strings. Each faces a neighborhood with one X 
and one Y agent, in addition to themselves. In 
Ellison’s (1993) model, each chooses a best 
response to his two neighbors. By assumption, 
Y is risk dominant and hence is a best reply 
when one neighbor plays X and one plays Y. 
Hence, the agent playing Y retains his strategy 
while the agent playing X switches to Y. The 
string of Y’s thus grows while the string of 
X’s shrinks, ensuring that best-reply learning 
leads to the selection of the risk-dominant 
equilibrium. 

In our imitation model, the X player on the 
boundary earns a payoff of 1, while the adja- 
cent X player earns 2. The Y player on the 
boundary earns a +’ while the adjacent Y 
player earns 26. Comparing the average pay- 
offs, we find that the boundary player Y retaiis 
his strategy if a + 38 > 2, while a boundary 


15 As with Altruists and Egoists, we find the displays 
easier to read if we use a lower case x to represent the 
strategy X. 
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FIGURE 3. CLASSIFICATION OF GAMES 


X will turn into Y if a + B > *4. Hence, there 
are three subregions of (œ, 8) values within 
the region of coordination games in which risk 
dominance and payoff dominance conflict. In 
the first region, a + 32 > 2 and a + £ > °h, 
and hence both players will play Y in the fol- 
lowing period. In the second region, œ + 
38 <2 and a + B < *4, and hence both will 
play X in the following period. In the third 
region, a + 38 > 2 anda + £ < °h, and both 
will retain their strategy. In the first region, the 
string playing the risk-dominant action will 
grow, in the second it will shrink while the 
string playing the payoff-dominant action 
grows, and in the third region each string 
maintains its length. 

If a string of agents playing the risk- 
dominant action Y is to expand, its payoffs 
must provide a premium over that required for 
risk dominance (i.e, a + B > °). This is 
necessary because an agent at the end of a 
string of X agents compares not whether X or 
Y is a best reply, but whether the X or Y players 
in his neighborhood are earning higher aver- 
age payoffs. One of the X players in his neigh- 
borhood is bordered by two other X players, 
and hence receives an exceptionally high pay- 
off. Risk dominance alone is not enough to 
overcome this payoff. 

If a string of agents playing the payoff- 
dominant action X is to expand (while Y is risk 
dominant), then we must have a + 38 < 2 
and a + 2 < *4. In conjunction with the re- 
quirement that a + 6 > 1, these inequalities 
require a > p. Hence, the payoff to playing 
strategy Y must be greatest if the opponent 
































plays X, even though (Y, Y ) is an equ 
This occurs because the neighborhood 
agent who plays Y, and who is located Ong 
end of a string of Y agents, contains a Yp ie 
who faces two Y opponents and hence eyga 
relatively high payoff. The average payas 
X can be highest only if the payoff in 
rium (Y, Y) is relatively small. This js į 
compatible with risk dominance op} 
a> Bp. 

Given that strategy Y must receive a p $ 
mium over risk dominance in order to expag iÈ 
and given that strategy X can expand ini 
absence of this premium only if the addi 
condition a > £ holds, then it is no sure 
that there are some cases in which nejt 
string will expand. Imitation can then 
peaceful coexistence of the two strategies’, 
like best-response behavior. Imitation 
the coordination failures created by cony 
tence to persist because agents on the boug Sime gutsjsmne 
ary of a string, and hence experienciyiemgainbutic 
coordination failures, are most likely to iè SZASUEEN:.: 
serve other agents with the same strateji RE PROPOS] 
who are not facing coordination failures S BEA ihen: 1 
to observe agents with the other strategy wk 
are plagued by such failures. This introdu] 
a force against changing strategies, and builg 
sufficient inertia into the system to supë 
coexistence. i 


Il. Mutations 


of each period, and after imitation has 0% 
curred, each agent takes a draw from ale Sits 
dependent, identically distributed Berno ia 

random variable. With probability A, this% 
is a mutant and changes his type, either 1 
Altruist to Egoist or from Egoist to AIZ $g 
With probability 1 — \, this agent experia” $ 
no mutation. We will be interested in the 5 
in which à is small, so that imitation 8” 
primary force driving strategy revisions 
study this by examining the limiting 


7 re 163. 
the mutation probability \ goes to zero. + 


be 


16 In economics, the common practice is to fol 
lead of Michihiro Kandori et al. (1993) and H.: 
















































l ` þe the probability that the combina- 
choad E :¢ imitation and mutation changes the 
4 ‘given that the current state is i." Then 
veg iS again a Markov process on the 
rence Sa: ‘space S, which we refer to as the 

i Sal tation-and-mutation dynamics.” Notice 
T, > 0 for all i and j, which is to say that 


eceive ‘ain 

ee a has a unique stationary distribution. The 
>xpand m ions of states reached along any sample 
the additi pub approach this distribution almost surely, 
is no sian sd.the distribution of states at time f ap- 
which nig this distribution as ¢ gets large. (John 
an then’ meny and J. Laurie Snell [1960 Theo- 


gM’ penis 4.1.4, 4.1.6, and 4.2.1.) 
í r We study the limit of these stationary dis- 


tation 4 

ad by coe wibutions as the probability of a mutation » 
on the bo -geta small, which we refer to as the limiting 
experi idfribution 


ROPOSITION 5: Let 0 < C < 'h. IfN > 
then the limiting distribution places posi- 
vë probability only on states contained in ab- 


r strategy y on Ste 
Chis introk Se/BINg sets of the imitation process in which 
ries, and b M. 82 proportion of Altruists is at least 0.6. 

em to Si 


techniques involved in establishing this 
‘which holds for u € (0, 1], were de- 
ved by M. I. Freidlin and A. D. Wentzell 
4384) and were introduced into economics 
Hy Kandori et al. (1993) and Young (1993). 
fgument begins by observing that when 
® S€ Mutation rate is small, the system spends 
‘all of its time in absorbing sets of the 
dynamics, and hence the limiting dis- 
allocates all of its probability to such 
vements between absorbing sets of the 
t dynamics can be accomplished only 


wes in them 
; that at the; m 


oist to AN i 
sent expel o Rons. The system will allocate most of 
asted in Utes ility to absorbing sets of the imita- 
2 n ug 

imitation 3 

y revisi 


iology, the concept of an evolutionarily 
“(Jobn Maynard Smith [1982]) is built 
Presumption that mutations are arbitrarily im- 
Foupared to the forces of selection. 
probability that mutations change the state 
current state is i, then Ty = ,PxQ,;. 


) in concentrating on arbitrarily small mu- 
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tion process that are easy to reach, in the sense 
that it requires relatively few mutations to 
reach their basin of attraction from other ab- 
sorbing sets. The proof involves showing that 
as long as the population is sufficiently large, 
it is much easier for mutations to introduce 
Altruists into a world of Egoists than for mu- 
tations to eradicate Altruists from a purely al- 
truistic or mixed world. 

One’s initial impression might be. that muta- — 
tions should be inimical to Altruists, because a 
mutant Egoist will thrive and grow when intro- 
duced into a collection of Altruists while a lone 
Altruist will wither and die when introduced into 
a collection of Egoists. Notice, however, that a 
small clump of Altruists in the midst of Egoists 
will not only survive, but will grow. It takes only 
three adjacent Altruists in a world that is other- 
wise completely Egoists to ensure that the imi- 
tation dynamics lead to an absorbing set 
containing at least 60-percent Altruists. 

It takes only a single mutation to introduce 
an Egoist into a world of Altruists. However, 
the resulting Egoist string can grow no longer 
than three. In light of this, consider an initial 
state that consists only of Altruists. A mutation 
creating an Egoist or a clump of Egoists will 
prompt imitation dynamics leading to an ab- 
sorbing state with no more than three Egoists. 
To get additional Egoists, additional mutations 
are required. These mutations can lead to 
states where there are many small clumps of 
Egoists. As these clumps become more nu- 
merous, and hence closer together, additional 
mutations join together previously separated 
clumps of Egoists. But as long as there are still 


‘some strings of Altruists, these newly joined 


strings of Egoists will shrink, replacing two 
original strings with a new, shorter string (of 
length three or less) and ultimately decreasing 
the proportion of Egoists. In order to further 
increase the proportion of Egoists, mutations 
must simultaneously eliminate all strings of 


. Altruists, But this requires a large number of 


mutations, if N is large, and hence is extraor- 
dinarily unlikely. If N exceeds 30, then it takes 
at least four mutations to eliminate all Altru- 
ists, which suffices for the result. 

Altruists can thus invade a world of Egoists 
with only a local burst of mutation that creates 
a small string of Altruists, which will then 
subsequently grow to a large number of 
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Altruists. Mutations can create small pockets 
of egoism, but these pockets destroy one another 
if they are placed too close together, placing 
an upper bound on the number of Egoists that 
can appear. The only possibility for surpassing 
this bound lies in a ‘‘global’’ mutation com- 


bination that simultaneously attacks all strings.. 


of Altruists. Mutations thus lead much more 
readily to absorbing sets with Altruists than 
absorbing sets without them, and the limiting 
distribution concentrates all of its probability 
on the former. This reinforces our finding that 
absorbing sets containing Altruists are the lim- 
iting outcomes in the absence of mutations, as 
long as there is some initial probability of al- 
truism in a large population. Our model thus 
differs from many mutation-counting analy- 
ses, in that our limiting distribution does not 
depend critically on highly improbable se- 
quences of mutations and hence need not in- 
volve extraordinarily long waiting times. 

If we consider large populations, mutations 
will ensure that there are more than 60-percent 
Altruists in the population, though less than 100 
percent. The exact calculation of the limiting dis- 
tribution is tedious, but we can establish some 


bounds. The calculation of these bounds is sig- — 


nificantly simpler for the case of u < 1, though 
u can be arbitrarily close to one.'® 

The argument proceeds by noting that if the 
number of Egoist strings is too small, then the 
Egoist strings will be far apart and most Al- 
truist strings will be long. A mutation will then 
tend to strike in the midst of Altruists and cre- 
ate a new string of Egoists, increasing the 
_ number of Egoists. If there are many Egoists 
strings, then these strings will be relatively 
close together, separated by short Altruist 
strings. Mutations will then often strike suffi- 
ciently close to two Egoist strings as to give 
rise to imitation dynamics that merge the two 
Egoist strings, thereby reducing the number of 
Egoists. We thus expect a centralizing ten- 
dency in the number of Egoists. We have: 


'8 The advantage of » < 1 is that all of the absorbing 
sets under the random imitation dynamics are then single- 
tons. This makes it easier to do the necessary calculations 
(and reduces the number of calculations) of the probabil- 
ities that mutations transform a given absorbing set into 
the basin of attraction of another. 























PROPOSITION 6: Let u < 1. Then the 
of the limiting distribution, as the pop 
size gets large, restricts probability to ah 
ing sets in which the proportion of Altry 
between 70 percent and 87 percent. 


IV. Larger Neighborhoods ` 


We have assumed that agents interact i 
with their immediate neighbors. This segg 
examines an extension of the model tha kgs 
lows us to make the following point: decra ti 
ing the cost of altruism can be bad.ten 
Altruists. 


are of ‘‘radius two’’ in this case. 

As in the previous case, the cost of altruiz 
plays a crucial role in shaping the results, ¥i 
study two intervals for the parameter t} 
namely (4, *4) and (*%, 1).'? Changin 
value of C within such an interval does Æ 
affect the outcome, while we shall see that} 
two intervals give different behavior.” Wek 
u = 1 throughout. : 

The investigation of the model with nes 
borhoods of radius two and costs C € (‘e!% 
begins with a calculation of transition 1 
The nontrivial conditions under which af 
truist will remain an Altruist are the follo 
cases and their mirror images, where th 
truist in the center is the agent in questio! 





1 When agents iriteracted only with their i ee 3 
neighbors, the only relevant cost consideratio 
whether C was larger or smaller than '/. 

2 If C > */,, then altruism is so costly that only 
survive. The results for 1 < C < */, are qualitativ 
ilar to those for */, < C < 1, with one quantitative 
ence noted below. Cost levels '⁄ < C < “ls 
similar to those of /, < C < ĉe. Costs C < ‘h 
ticeably different and more complicated behaviof 
do not investigate here. Cost levels that lie at 
aries between these various intervals create com 
arising out of cases in which the average payoffs 8 
ists and Egoists are equal. 


wg, 
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hen for a strategy that could be either A 
€ popy 
ity to ab 
of Alt Eaaa a xEEE 
ent. 
aaaa a EEEx 
ods 7 aaaa a aEEE. 
interact $ ` ED or hich an Egoist will become an 
Sas, é cases 1n W 
This m Inu ist are the following, as well as their mir- 
odel Se one 


ages: 


oint: 

be badi _qaaaE E  EEEx 

each Alt Eaaa E EEEE 
c good t aaaa E EEEx. 
Jach ager 5 
of his fouro gnese allow us to prove:”' 
es the striai. jee... 
» with the FROPOSITION 7: Let neighborhoods be of 
neighbo dius two and leth < C < 1. Then absorbing 
ie. sets generically consist of (i) the state in which 
cost of al al agents are Egoists and the state in which 


jal agents are Altruists, and (ii) sets contain- 


paramete ing states in which strings of Altruists of length 
’ Changing SE ve or more are separated by either strings of 
terval dogs thfée E's or blinkers, where blinkers consist 
hall see tha emately of one E and five E’s or consist 


ravior.” WABE alternately of two E’s and six E’s. With the 


& eiception of the state in which all agents are 


del with j4 $, the proportion of Altruists is at least 

ists C E : 

Transition; `, ; 

ər which proof mimics that of Proposition (1) and 

re the fo tted. To obtain the minimal proportion 
where lists in the stable sets that contain Al- 


-Wẹ note that we can pack blinkers that 

eretate between two and six E’s next to each 

Sher with five A’s between them, in the fol- 
Ing Way: ‘ 


+ a@aaaaEEaaaaaEEEEEEaaaaa::: 


4s, . 
Eenerically”” in this statement allows us to avoid 
Within the interval (5/,, 1) that create payoff ties 
Altruis ts and Egoists. For 1 < C < 5/,(C #), 
> Same characterization of absorbing sets, ex- 
Pike tS must be separated by at least six Altru- 
S © minimal percentage of Altruists 0.6. 


hat lie at Of 
create come, 
‘age payo. 


. the following: 
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This guarantees a maximum of Egoists, and 
here we have a proportion of 10/18 = 54 
Altruists. 

For costs in the interval (*/4, °4), we have 


PROPOSITION 8: Let neighborhoods be of 
radius 2 and let */, < C < */s. Then generi- 
cally, absorbing sets include (i) the state in 
which all agents are Egoists and all are Altru-. 
ists, and (ii) sets containing states in which’. 
strings of three or more Altruists are separated 
by strings of exactly three Egoists, or by blink- .. 
ers which alternate between one and five or ` 
between two and six Egoists.” Except for the 
state in which all agents are Egoists, the pro- 
portion of Altruists in an absorbing set is a 
least 'h. 


The proof again mimics that of Proposition 
(1). To obtain the lower bound on the number 
of Altruists, note that it is possible to arrange 
blinkers in the following way: 


( 8 ) -- aaaEaaaEEEEEaaaEaaaEEEEEaaaEaaaaa: >- 
++: aAEEEEEaaaEaaaEEEEEaaaFaaaEEEEEaaa: -> . 


This has '/ of the population as Altruists. 
There is no denser way to arrange blinkers. 

In each case a result analogous to Proposi- 
tion 2 holds, establishing that if agents’ initial 
identities are independently determined and 
may be altruistic, then as the population 
grows, the probability of convergence to a 
state in which altruism survives approaches 
unity. 

The lower bound on Altruists is lower for 
C € (*4, `l) than in the case of higher costs, 
being '/, rather than °/. In this sense, it can be 
disadvantageous for Altruists to have their al- 
truism come too cheaply. The forces behind 
this result are revealed by comparing (7) and 
(8). Example (7) reflects the fact that when 


ae 


2 If two one/five blinkers are separated by a string of 
only three Altruists, then the blinkers must be out of phase, 
so that the state in which one of the blinkers has five Ego- 
ists is the state in which the other blinker has only one 
Egoist. A two/six blinker requires at least five Altruists 
on each side. 
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costs are relatively high, strings of Altruists 
must be at least five Altruists long in order for 
Altruists to receive payoffs high enough to 
survive. Example (8) reflects the fact that for 
lower costs, Altruists’ payoffs are higher and 
shorter strings (of length three) of Altruists 
can survive. The imitation dynamics can then 


-* Tead to outcomes in which islands of Egoists 


are separated by strings of only three rather 
than five Altruists, and hence a smaller pro- 
portion of Altruists. 


V. Conclusion 


We have shown that if players choose their 
strategies in games by imitating successful 
players, and if there is a local or neighborhood 
structure to both the interaction between 
agents and their learning, then altruistic be- 
havior can survive. 

Imitation and the local nature of the inter- 
actions are both important to this result. Best- 
response learning would immediately lead 

` agents to adopt the dominant strategy of Ego- 
ist. Agents who choose their strategies by im- 
itating others will imitate Altruists, but only if 
the latter happen to be earning relatively high 
payoffs. The role of the local interaction struc- 
ture is to allow Altruists to huddle together in 
concentrated groups. The benefits of the public 
goods supplied by Altruists are then enjoyed 
primarily by Altruists, leading to higher pay- 
offs than those of Egoists, who tend to be sur- 
rounded by other Egoists. 

A group of Altruists is always a ripe target 
for invasion by a mutant Egoist, who will 
thrive on the public goods provided by the Al- 
truists. For this reason Altruists can survive, 
but they generally cannot conquer. Instead, the 
Altruists will be riddled with pockets of Ego- 
ists. However, there are limits to the expansion 
of egoism. As more and more Egoists try to 
free ride on nearby Altruists, the payoffs of 
Egoists fall and imitators become Altruists. 
The result is the preservation of altruism in 
coexistence with egoism. We can hope for al- 
truism, but not for a perfect world of altruism. 

Attention is drawn to the importance of the 
mutations in a local interaction model by re- 
sults from biological studies of group selec- 
tion. Group selection models typically assume 
that agents are arranged in isolated groups cre- 



































ated by either spatial separation or kine 
lationships.”* Payoffs are measured in tate) 
expected numbers of offspring, and theses 
portion of the population playing a re ix 
high-payoff strategy increases bec 
agents playing that strategy have re 
large numbers of offspring. As in our 
of learning by imitation, thé important 
erty of this reproduction-based dynamic; 
cess is that if the agents in a location te 
be predominately Altruists, then altruism 
spread to nearby locations, as the conce: 
Altruists earn high payoffs and hence 
many offspring that spill over into neighbiity 
locations. ey 
Kin selection theories are now widely: 
cepted as explanations for some seemingly: ne 
truistic behavior.” However, group seleg 
models that are not based on kinship rel 
ships have been criticized [e.g., Willing 
(1966); Dawkins (1976)], initially beca 
the mechanism that caused some groups è greg 
grow faster than others was not specified, 
subsequently because the combination k gen amili 


thought to be implausible.” 


*8 See, for example, Vero Copner Wynne-Edwed 
(1962, 1986), W. D. Hamilton (1964, 1972), Geog 
Williams (1966), Eshel (1972), David Sloan Wi 
(1975, 1987), Dan Cohen and Eshel (1976), C. Mat 
and S. D. Jayakar (1976), Maynard Smith (1976); 
Eshel and L. L. Cavalli-Sforza (1982). 

* For example, kin selection arguments have beeni 
to explain the behavior of several species of tropical 
terflies, some of which incur a cost to develop a bittet! 
that discourages birds from preying on others [ 
Pierson Brower and Jane Van Zandt Brower. (1% 
Brower (1969), Woodruff W. Benson (1971), al 
(1972)]. Kinship relationships play an especially 9 
tant role in explaining the behavior of the social ins 

35 Robert Boyd and Peter J. Richerson (1985) # 
that group selection arguments may be applicable # 
plaining the evolution of altruistic behavior among 
mans. They examine a model in which people haves 
for conformity, so that altruism is a strict best f 
long as sufficiently many other people cooperate =y 
ology, group selection arguments are often invoke.” 
plain the inefficiency of weapons used in compel! 
mates, such as excessively branched or curv! 
(Konrad Lorenz, 1963), and are used to expl 
imposed limits on reproductive ability when 4 p° 
is threatened by overpopulation. Wynne-Edwards 
suggests examples of the latter phenomenon, and é 
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‘model differs from a typical biological 
rag up selection in that our agents are 
ged in overlapping rather than isolated 

as.- This overlapping-neighborhoods 

a opens the possibility that mutations 
ng Egoists into our model can favor 


“gisrupti and eliminating strings of Altru- 
<and ae causing two groups of Egoists 
and then shrink to a single, small group. 
contrasts with typical biological group se- 
yn models, where migration and espe- 
‘mutation work relentlessly against 
and one must struggle to find a plau- 

ghle explanation for why the migration and 
Wa gutation rates are sufficiently small to allow 
ee y to survive. In addition, mutations in 
sar model can introduce Altruists in the midst 
af] Egoists. In biological models with isolated 


mutation rates that are unrealistically high and 
sizes that are unrealistically small. In the 


specified -prezence of overlapping groups, relatively 
mbination, mali numbers of mutations allow Altruists to 
and infre gain a local foothold from which they can spill 


rt altrui ver. 
iterpret our analysis as a biological model 
with local interactions and dynamics based on 
production and emigration, providing a new 
planation for altruism that exploits the over- 
group structure. 
model of agents occupying locations 
g Sond a circle is very simple. What happens 
are placed in a plane, or in a higher- 
‘asion structure? To gain some insight into 
tases, recall that Altruists fare poorly 
‘exposed: to many Egoists, while Egoists 
Well when exposed to many Altruists. 
ig agents to be arranged along a circle 
ae scutes that any group of A’s cannot have too 
Altruists who are on the boundary and 
sare exposed to Egoists, and ensures that 
Eup of Egoists cannot have too many 
tS exposed to Altruists. This in turn 
$ conditions under which Altruists are 
‘thrive> Moving to the plane or to 


1 (1971), a 
in especial 
f the social 





rehavior ama, 
n people hav 












often invo. 
‘din comp ig ve 
d or curio 
ai to explteg by Frank Fenner (1965) (Mixoma viruses, 
ty when a p eee abbits), John J. Christian (1970) (rodents 
nne- Edwards pecunmals ), and Frank M. Stewart and Bruce R. 
7 t) (bacterial viruses). 


menon, and i 
Ñ 


-while being detrimental to Egoists, ~ 


groups, such a possibility is thought to involve — 


aver Into nearby locations. We could thus re- ~ 
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richer spaces raises the possibility that groups 
of Altruists will appear that are irregularly 
shaped and that expose virtually all of:their 
members to Egoists. These Altruists may then 
not survive, and the persistence of altruisnr-ap- 
pears to be less certain. 

' The extensive simulations of Nowak and 
May (1992, 1993) and Nowak et al. (1994) 
suggest that in the absence of mutations, there 
are many initial conditions from which a sig- 


` nificant proportion of Altruists persist. Once 


again, Egoists in their model do well in the’. 
midst of Altruists while Altruists do poorly in 


the midst of Egoists, and concentrated groups _ - 


of Altruists can then expand. The dynamics are 
much more complicated than in our simple 
model, but altruistic behavior typically 
survives. 

What if the game contains more than two 
strategies? Altruists can survive in our model 
because the Altruists near the end of a long 
string of Altruists earn higher payoffs than do 
the Egoists near the end of long strings of Ego- 
ists. When we extended the argument to gen- 
eral 2 X 2 games, the criterion for expansion 
turned out to be a mixture of efficiency and 
risk dominance. In games with more than two 
strategies, the criteria for whether strategy x 
can expand at the expense of y,.1.e., for 
whether agents on the boundary between 
strings of x and y will switch to x, involve 
pairwise efficiency and risk-dominance con- 
siderations. If there is a strategy that is rela- 
tively efficient and does not fare too badly in 
pairwise risk-dominance comparisons with all 
other strategies, then the system can converge 
to states featuring primarily that strategy. 
However, cyclic behavior can also appear in 
which strategy x expands at the expense of y, 
y expands at the expense of z, and z expands 
at the expense of x. . 

Imitation may often be important, but adopt- 
ing the strategy observed to earn the highest 
average payoff is a very simple decision pro- 
cess. What about imitation rules other than 
simply comparing average payoffs? For-ex- - 
ample, agents may base their choices not only 
on average payoffs but also on the number of 
agents they observe playing each strategy. 
This introduces elements familiar from the lit- 
erature on strategies for the infinitely repeated 
prisoner’s dilemma. Tit for tat, for example, - 
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simply adopts the previous strategy it has ob- 
served, The implications once again depend 
.. upon the behavior of an agent located at the 
end of a string of similar agents. An Altruist 
at the boundary between sufficiently long 
strings of Altruists and Egoists observes equal 
numbers of-.Altruists. and Egoists among~her 
opponents, as does the adjacent Egoist. If ob- 
serving the Altruists in this sample makes 
agents more likely to cooperate by being Al- 
truists, then our results are reinforced. If ob- 
serving the Egoists makes the agents more 
likely to retaliate by being Egoists, and if this 
is sufficient to overcome the average payoff 
advantages of Altruists, then our results will 
be reversed and altruism will vanish. The suc- 
cess of altruism then depends upon whether 
agents facing both Altruists and Egoists tend 
to see their glasses as half full or half empty. 

A great deal of work remains to be done in 
extending the analysis to larger games as well 
as more complicated spatial structures and 
learning rules. It is clear, however, that dy- 
namics driven by imitation can differ signifi- 
cantly from the familiar best-reply dynamics 
and that imitation coupled with local interac- 
tions opens the possibility for altruistic behav- 
ior to survive. 


APPENDIX 


PROOF OF PROPOSITION 1: 

It is immediate that the states in which all 
agents are Altruists or all agents are Egoists 
are absorbing states, because imitation cannot 
~” introduce Egoists into a world in which there 
are only Altruists, or vice versa. 

To find the remaining absorbing sets, con- 
sider what happens to a string of A’s as the 
imitation dynamics proceed. From (2), any A 
string of length one immediately: disappears. 
Similarly, if we have an A string of length two, 
the two A’s in this string immediately become 


EE E aa aE E aa 
xE a EE xE E EE 
EE E EE xx E EE 
























E’s. In the process, however, the adjaceni ss: 
may switch to A’s. What happens to thesi's 

jacent E’s? There are four possibiliti 
following transitions describe the fate of 
(the center agent in each case) that ini 
sits just to the left of the string of two 4 
similar analysis holds for the E-on the 


ascertain (see below). PaA 
Moreover, the x’s in the final line can b Ay } 
only if there existed a string of three or ngs] 
A’s to the left of our segment, to which 
agents have now become attached. Hence. 


longer? From (1)—(2), the A’s at the end a j 
such string are the only potential candidat] 
for becoming E’s, and the only way that sik 


of { —2, —1, 0, 1, 2}. Because the string ca f 
increase in length only if it borders a segm8 gc. 
of three E’s [ from (1)], the string cam f= 
merge with any other A strings of length th 9 f 


for such a string. It ean persist forever a 
distinct string, perhaps varying in F ori 


riods without giving birth to new strings. WY 
thus have that strings of A’s can be destroytt 
but cannot be created. i ac 

Together, these results give: There exists tsi 
time T such that the number of A strin "a8: Cont: 
time T is less than or equal to the numbet 
A strings of length three or more at time 
the number of A strings in any subsequent} 


riod is equal to the number at time T; aNG? Ble i rv 
A strings in subsequent periods are lenf QER sPosttios 
three or longer. peers Of len 
ods 

ow 

aS isat] 

es 

Salt 

Ea E aa aa E aa 4 S St 
; E E a 

EE E EE xE E EE vile. ae 
EE E EE xx E EE res Scnt: 







































what can we say about E strings? First, no- 
phat the number of A and E strings must 
aed equal. Next, suppose that time 7 has been 
b so that all A strings have length at 
_ Then from (1), any E string the 
e hich is more than two declines in 
genath by two, a string of length two retains 
ae , and a string of length one increases 
by two. Hence, we will eventually 
gave Egoist strings of length two or blinkers, 
qhemating between lengths one and three, but 

É o longer strings, giving: There exists a time 
fter which the number of E strings is less 
<shan the number of E strings in the initial state 
is constant, and E strings either remain at 
kagth two or alternate between lengths one 
! and three. This gives Proposition 1.1. Itis now 
F an easy calculation to check that since A 
strings occur in lengths at least three, and since 


foy 4 zations between length one and three, that the 
. Heng groportion of A’s, if there are to be any A’s at 
ge in lenge all, must be at least 0.6. 

e string tay A 


brs a segis 
string caiggy PROOF OF PROPOSITION 2: 
f length IW ft is immediate that the system must con- 
i § verge to a state containing Altruists, and hence 
forever) NYM &ftate containing at least 60-percent Altruists 
length, oR (by Proposition 1) if there exists a persistent 
e point, cak & and that the probability of a persistent 
> next (won Ag approaches unity as N gets large if initial 
icatities are randomly, independently deter- 
Hied, with positive probability on Altruist. It 
&n femains to verify the characterization of 
CIS stent strings. We examine the case of a 
g containing at least five adjacent A’s. 


the numb k Sbowing that the remaining strings identified 
e at time: (21) are persistent, and that any other string 
ubsequ minated by period three, involves 


itforward variations on this argument. 
Osition 1 has already shown that every 
AB of length two or less is eliminated within 
Periods.) 


LAS at least five, cannot disappear. In par- 
© show that if there exists a string of 
at time t, then either all five of these 
ust also be Altruists at time £ + 1 or 
all be Altruists at time ¢ + 2. This 
Wgardless of the strategies played by 
SSents in the system. 


E strings occur in either length two or alter- - 


‘Show that a string of A’s, the length of - 


i 
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Suppose we have a string of five or more 
A’s bordered on each end by an Æ. Each of 
these two E’s must have either an A or E on 
its other side. This gives us four possibilities 
to consider. First, suppose each Æ has an A on 
its other side. Then from (1)-—(2), the system 
proceeds as follows: aes 


---a@E aaaaa Ea... 


-++ EE Eaaak EE: 
Ex. 


-xE aaaaa 


As usual, an x holds the place of an agent who 


may be either an Altruist or an Egoist. For con- 


venience, the original string of five A’s is sepa- 
rated by spaces. A similar result clearly holds if 
the original string contains more than five A’s. 

Alternatively, one of the E’s on the end of 
the string of A’s may have an E on its other 
side while the other may have an A on its other 
side. This gives us the following case and its 
mirror image: 


-++@E aaaaa EE- 
EE Eaaaa xE.: 
xE aaaaa xxt.. 


Finally, the Ẹ’s on both ends of the string 
of A’s may be bordered by £’s. Then we have: 


-EE aaaaa EE-:: 
‘++ Ex aaaaa xE. 


In each case, the result is that any string of at 
least five Altruists persists. 


PROOF OF PROPOSITION 3: 

Let there be countably many agents, de- 
noted by the integers. Consider the initial state, 
and-suppose that, in this state, agent 0 is the 
rightmost agent of one of the following se- 
quences of agents: 


aaaaaE EEaaaaE EEEaaaE 
aaaaEE EEaaaEE 
aaaEEE. 
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Let such a string be called a ‘‘persistent-string- 
with-right-boundary.’’ Then we know that 
contained within agents {—6, --, —l} isa 
persistent Altruist string. 
` We now let 7 be an integer with the property 
that, in the initial state, the set of agents {0, 
-- w- T} contains a persistent string. Our task is 
to calculate an upper bound on the expected 
value of 7.” Because the identities of the 
agents in {0, 1, =} are determined indepen- 
dently, each with probability p of being an Al- 
truist, we can describe the initial condition, 
and hence the expected value of 7, by defining 
anew Markov process as follows. Let there be 
17 states, denoted by {1, 2, ..., 16, 7}, where 
we think of the first 16 of these states as being 
associated with the following sequences: 


1: aE 7: aEaa 13: aEaaaE 
2: EE 8: EEaa 14: aEaaaa 
3: EEE 9: EEEaa 15: aEaaaEE 
4: aka 10: aEaaa 16: aEaaaaE. 
5: EEa 11: EEaaa 

6: EEEa 12: EEaaaE 


We define the state at step 0 as being one of 
states 1, 2, or 3, depending upon whether 
agents —2, —1, and 0 are characterized by xaE 
(x € {a, E}), aEE, or EEE. We define the 
state at step t > 0 to be state T if any agent 
from the set {0, ..., t} is the rightmost agent 
in one of the following sequences: 


aaaaa EEaaaa EEEaaa 
aaaaEE EEaaaEE 
aaaEEE . 


If the state at step ¢ is not T, then the state is 
given by i € {1, ..., 16} if agent ¢ is the right- 
most agent in the sequence corresponding to 
state i, and the same is true of no other i € {1, 


26 The assumption that N is infinite makes its appear- 
ance here. It allows us to calculate the expected value of 
7 while ignoring the possibility that the set of agents is 
exhausted before encountering another persistent string, 
with our search for such a string having taken us around 
the circle and back to our point of departure at agent 0. 
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.., 16} with a longer sequence. Intyj¢ 
the system begins at step zero in one o 


then examine individuals 1, 2, 3, and sọ On, 
each case using that individual’s identi; 
either Egoist or Altruist to define a tr 
of the new Markov process. The latter ¢ 
state T (for ‘‘terminal’’ ) whenever an indi 
ual has been encountered who allows 
confirm the existence of a persistent s 


ayers i 
Ni i 


Aes 


rhood. | 


j Pae EE bers of 
Markov process are calculated on the baş hood. / 


the assumption that agents in the initial- 


probability p and Egoists with probabili 
p. it is then straightforward to calcul 
expected number of steps to absorption in 
T from each of states 1, 2, and 3 (see E. Senex: 
1981 Theorem 4.5). Numerical calculati 
produce a table matching that given in Prop § 
osition 3, where these numbers are the mat $: > 
mum of the expected number of steps front $ ¥ 
three initial conditions given by states 1, 2,a# fig 
3.7 These figures are upper bounds on thet 
pected number of agents, between the end oft 

persistent-string-with-right-boundary and &B 
end of the next persistent string to the rig ® 
The boundary of the persistent-string-wilt 

right-boundary may contain up to three Ef 
ists, but a persistent string must contain atl 
three Altruists, so these numbers are also 
per bounds on the number of agents beii 
two persistent strings of Altruists. 


PROOF OF PROPOSITION 4: 

Let the types of players be denoted by 18: 
2. Let each player i have a set of players ghee § 
he potentially imitates, called his leaf 
neighborhood, and a set of players with We: 
he interacts, called his interaction neight® 
hood. In particular, player i’s imitation ; 


2? The details of these calculations are avaiable ; 
quest. Differences in the expected number of 4 A ; ts 
these three initial conditions are small. The ™4* 5% 
such difference appears for large values of p, 44 
for p = 0.5 but only 0.3 for p = 0.1. 
























aihe strategy that receives the highest 
pa off of the strategies represented in 
ing neighborhood.” In our model of 
¿and Egoists, the interaction neigh- 
of player i included his two nearest 
while his learning neighborhood in- 
ese two nearest neighbors and him- 
41 and N? be the sets of type-1 and 
“A :players in agent i’s learning neighbor- 

Tet N! and NP be the sets of type-1 
4756-9 players in agent i’s interaction 
np a ni! n£, n! , and n? be 
ümbers of players in the sets Nj', Ni’, 
d NË. Let P; be player i’s payoff, and 
a | + n? = n' be the size of the interaction 
séiahborhood. According to the imitation rule, 


for this ta 

2 the basig 
e initial! T 

Altruist 


= Kın’ + Kyn? —= C, 











k 


= (K; agi K,)nj' + Kan’ = C; 





' his how obvious that when K, — K, > 0, the 
cs will be identical for all pairs of types 
3) for which (C, — C,)/(K, — K) is the 


F OF PROPOSITION 5: 
£ denote the state in which all agents 
: Let A be the state in which all 


fer i is excluded from his interaction neighbor- 
included in his learning neighborhood. In the 
and Egoists, this simply reflects our 
the costs of altruism as the net costs, 
ts of one’s own public good provision have 
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agents are Altruists. Let X(n, m) denote the 
collection of absorbing sets with the property 
that in any state contained in such an absorbing 
set; at least some agents are Altruists and all 
agents are Altruists except n strings of Egoists 
of length two and m blinkers, where n = 0 and 
m = 0. We define X(n, m) only for values of 
(n, m) for which X(n, m) is nonempty. Then 
A is the unique element in X(0, 0) and every 
absorbing set other than Ẹ is contained in 
some X(n, m). 

It suffices to show that D(A) < D(F), 


-where D is defined in Lemma 3 of Samuelson 


(1994). For this, it suffices to show that: 


¢ Three mutations suffice to transform £ into 

a state in the basin of attraction, under the 

imitation process, of a state in X(n, m) for 

some (n, m). 

Given any absorbing set in X(n, m) with 

(n,m) + (0, 0), there exists a state in the 

absorbing set which a single mutation can 

transform into a state in the basin of attrac- 
tion, under the imitation process, of an ab- 
sorbing set in X(n', m’) with n! + m' < 

n+morwithn’ <nandm =m+ l. 

e Given any state in any absorbing set in X(n, 
m) for any (n, m), it takes at least N/10 
mutations to reach a state in the basin of 
attraction, under the imitation process, of £. 


A State is in the basin of attraction, under the 
imitation process, of an absorbing set, if the 
deterministic imitation process (without mu- 
tations) leads from the state to the absorbing 
set. 

To establish the first condition, we need 
only note that if three mutations introduce 
three adjacent Altruists into state £, then Prop- 
osition 2 ensures that we then have a state in 
the basin of attraction X(n, m) for some (n, 


m). To establish the second condition, con- . . 


sider an absorbing set S’ in X(n, m). If m > 
0, then we need only choose a state in S’ 
which at least one blinker has only one Egoist. 
A mutation switching this Egoist to an Altruist 
then produces a state in absorbing set in X(n, 
m — 1). Hence, consider an absorbing set in 
X(n, 0). Now let a mutation switch an Egoist 
to an Altruist. The result is an isolated Egoist, 
(that was adjacent to the Egoist affected by the 
mutation). The next iteration of the imitation 
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process will produce a string of three Egoists. 
If all Altruist strings are still of length at least 
three, then we have a blinker and a state in an 
absorbing set contained in X(n — 1, 1). If 
. instead at least one Altruist string is now of 
length only two, then (from the proof of Prop- 
osition.L.) that string of, Altruists will disap- 
pear, while no new string can appear, yielding 
a state in an absorbing set in X(n’, m’) with 
notm' <n. 

Finally, we calculate a lower bound on the 
number of mutations required to convert a 
state in an absorbing set in X(n, m) into a state 
in the basin of attraction of £. The mutations 
must eliminate all of the strings of Altruists in 
the original state. We first notice that in order 
to eliminate a string of A’s of length k, we 
` must have at least [k/5]—the integral value 
of k/5—mutations.” A lower bound on the 
number of mutations needed to eliminate all 
string of A’s is then N/10, which arises in the 
case in which there are strings of A’s of length 
nine (which are the longest that can still be 
eliminated by a single mutation) with blinkers 
at the end of the string, where the blinkers are 
in phase and there are nine Altruists in the 
string when each blinker consists of a single 
Egoist. For sufficiently large N, and in partic- 
ular for N exceeding 30, this number exceeds 
three, giving the result. 


PROOF OF PROPOSITION 6: 

Fix the population size N. Let the Markov 
process induced by the imitation dynamics be 
(S, P), where S is the state space and P is the 
transition matrix, and let the Markov process 
induced by the imitation-and-mutation dynam- 
ics by (S, IT), where I is the transition matrix. 
We say that an agent chosen to assess her strat- 
egy, under the random imitation dynamics, has 
“‘received the learn draw.”’ 


2° This number is calculated by observing that if an 
Egoist is placed in the midst of a string of Altruists, the 
result is a blinker, with three Egoists in the next period. 
In order to eliminate a string of A’s, enough Egoists must 
be inserted so that after a period has passed and each Ego- 
ist given rise to a string of three Egoists, with blinkers 
possibly also converting the A’s at each end of the string 
into E’s, all remaining strings of A of the original string 
must be at most of length two. This requires at least [k/ 
5] mutations. 


can occur only if the probability attach 






























ay 
Step 1: This step shows that instead g 
amining (S, I), we can work with a s 
Markov process (K, A). To constry 
simpler process, we let [N/5] denote 
tegral value of N/5 and let the state 
K = {0, 1, ..., [N/5]}. We interpret. 
k € K as identifying the number of 
strings in an absorbing state of (S, P) ®% 
transition matrix is A, where A, is the mi BAe 
ability that a single mutation in (S, T); 44 
lowed by the imitation dynamics, leads 
an absorbing set with 7 Egoist strings tg 
absorbing set with j Egoist strings. Noti 
a mutation can create at most one new Epi 
string or can destroy at most one string yy. 
hence can cause the number of Egoist s 
to change by at most one. The proporti 
Altruists in the limiting distribution of ( 
matches the proportion in the limiting 
bution of (S, T). 
Step 2: We now examine (K, A). This 
birth-death process, since from state k, ths 
is positive probability of moving only to stg 
k—1,k,andk + 1. The stationary distributa 
6* of a birth-death process must satisfy thee 
tailed balance condition: 


O*(kK) Anais 


a) Gey i 


To complete the proof, it suffices to show t 
there is e > 0 such that for any N, if 2k/N! ET y: 
0.13 (recall that each Egoist string coni \yeSsumize 
two Egoists), then Ayip+)/Apaiy > 1 +4 ays g: 
and if 2k/N = 0.30, then Ay, + :/ArsuS i er 
1 — æ. In particular, this ensures (from 
that the ratio 6*(k)/5*(k + 1) is bounded 
low one when 2k/N = 0.13 and bo 
above one when 2k/N = 0.30. As N g0 
the number of pairs (k, k + 1) with 2k/N 
0.13 and 2k/N = 0.30, and hence the nul 
of pairs for which these bounds on thes 
ary distribution hold, approaches infinity. “ 


pi 
7 


%° The details of this construction, as well as OF: 
culations from Step 3, are available on request. Sioe, 
such string must contain at least two Egoists and we 
separated from other Egoist strings by at least thre" 
ists, there can be at most [N/5] such strings. 



























es k such that 2k/N € (0.13, 0.30) 
ee, 3) 


unity. oe 
“This step verifies the required in- 


< Recall that absorbing states consist 


ae a more Altruists. We first calculate a 
af und on A,,+1. A mutation creates a 
g of Egoists with probability one if it 
45 to egoism an Altruist who is bordered 
sefeast four Altruists on each side; with 
ity between zero and one if the Altru- 
rdered by three Altruists on one side 
at least four on the other; and otherwise 
Sih probability zero. In light of this, we can 
Šad a lower bound on the probability of increas- 
isthe number of Egoist strings by arranging 
‘geents so that there are eight Altruists between 
dach Egoist string, leaving one longer string of 
fefiaver Altmuists, and assuming that a mutation 
jaserting an Egoist between three Altruists on 
čas side and four on the other never creates a 
gew string of Altruists. The probability of intro- 
k: dacing a new Egoist string is then bounded be- 
§ kaa by the probability that a mutation strikes an 
Fseeat more than four Altruists away from the 
ead of the long string of Altruists, or 


A+ (N — 10k). 


2 

; N 

A similar calculation shows that the proba- 

aliy “of introducing a new Egoist string is 

§ @atimized if strings of Egoists are separated 
/ 8¥ Stings of only three Altruists, giving an 
bound of: ” 


We now turn to the probability of eliminat- 
ist „strings. An upper bound on the 
of eliminating such a string is: 





= 1 
Ais. = Prous 


Binmore and Samuelson (1997) for a sim- 





à reflects a three-Altruist buffer at both 
String of A’s in which a mutation cannot 
string of Egoists. 


: of two Egoists separated by strings” 


a “w 
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A lower bound on the probability of eliminat- 
ing Egoist strings is given by: 


(A6) 


We use these calculations to obtain: 


Axes _ N— 10k 


Akri SK +1) 


if k/N = 0.065 (and hence there are no more 
than 13-percent Egoists), N is sufficiently 
large, and e < 0.075. Similarly, 


Anns = 


Aus tx 


N-S5k-39 2 
2(k+1) 8 





l-e 


if k/N = 0.15 (and hence there are at least 30- 
percent Egoists), N is sufficiently large, and 
e < 0.06. This gives the result. 
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When Does It Take a Nixon to Go to China? 


By ALEX CUKIERMAN AND MARIANO TOMMASI* 









Substantial policy changes (like market-oriented reforms by populist parties and | 
steps towards peace by ‘‘hawks’’) are sometimes implemented by ‘‘unlikely”’ - 
parties. To account for such episodes this paper develops a framework in which” 
incumbent politicians have better information about the state of the world than 
voters. The incumbent is unable to credibly transmit all this information since 
voters are also imperfectly informed about his ideology. This paper identifies 
conditions under which an incumbent party’s electoral prospects increase the 
more atypical the policy it proposes. Popular support for a policy, or its ‘‘cred- 





ibility,’’ depends on the policy maker-policy pair. (JEL D7, D8, E6, C72) 


The history of public policy contains several 
episodes in which structural reforms or im- 
portant economic or foreign policy shifts were 
implemented by parties or policy makers 
whose traditional position was to oppose such 
policies. Argentina under (Peronist) Menem, 
Peru under Fujimori, and Bolivia under (pop- 
ulist) Paz Estenssoreo underwent profound 
market-oriented economic reforms.’ France 
privatized some of its public sector and shifted 
the emphasis of economic policy to price sta- 
bility during the 1980’s, under socialist Pres- 


* Cukierman: Eitan Berglas School of Economics, Tel 
Aviv University, Tel Aviv 69978, Israel, and Center for 
Economic Research, Tilburg University, Tilburg 5000 LE, 
The Netherlands; Tommasi: Universidad de San Andrés 
(1644) Victoria, Buenos Aires, Argentina. The Amnon 
Ben Nathan Chair in Economics at Tel Aviv University, 
the Harvard/MIT RTG in Positive Political Economy and 
CIBER, and the Academic Senate at UCLA provided fi- 
nancial support. We benefited from useful suggestions of 
three anonymous referees. We are indebted to Olivier 
Blanchard, Greg Hess, Eric Rasmusen, Tom Piketty, and 
seminar participants at Boston College, Brown University, 
the University of Chicago, Dartmouth College, the Euro- 
pean Center for Advanced Research in Economics at the 
Universite Libre de Bruxelles, the University of Geneva, 
Groningen University, the Innocenzo Gasparini Institute 
for Economic Research at Bocconi University, the Uni- 
versity of Limburgh, MIT, the Universidad de San Andrés, 
the University of Strathclyde, Tel Aviv University, the 
University of Tilburg, and UCLA for helpful comments, 
and to Carola Schenone for research assistance. 

! Dani Rodrik (1993) points out that it is ironic that 
populist and interventionist parties have implemented rad- 
ical trade liberalizations, fiscal adjustments, and market- 
oriented reforms. 
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ident Mitterand. In the late 1970’s, after, 
years of vehement opposition to trading ly 
for peace, hawkish Israeli Prime Mini 
Begin yielded the entire Sinai sare 
turn for peace with Egypt. His partner tog 
historical deal was President Sadat of Ea $ 
who is considered to be the first Arab leii 
to mount a relatively effective military cë 
paign against Israel. Having establish 
strong and persistent anti-Communist má 
during the 1950’s and 1960’s, President 
then opened the door to the international 
gitimization of the People’s Republic of 
in the early 1970's. 
These episodes should not be interp 
imply that large shifts in policy can be 
mented only by political parties having 
torical bias against such policies. Privatizlit gs 
and other reforms under Thatcher are an 0! 
counterexample. But the examples in the 
ceding paragraph raise an intriguing and 1m; 
tant question about the circumstances W a! 
which policies are implemented by “ 
political parties rather than by parties the 
ogies of which favor such policies. Our ob}: 
in this paper is to identify conditions 
which shifts in policy are more likely to 
plemented by improbable characters. T 
end, we first develop a political economy 
work in which such a phenomenon can 
and use it to pin down a set of conditions %3; 
makes it more likely that such policies ‘ce 
implemented by the ‘‘wrong’”’ parties. 
We then develop an explanation for “ 
reversals” within the framework of as 






















































no. 1 C 


ion about the mapping of policy 
ts into policy outcomes.’ Incumbent 
normally have better information 
eneral public about the likely out- 
of alternative policies. Governments 
with public policy issues on a daily basis, 
‘eae have access to the advice of specialists, 
ea fi some cases, they possess classified in- 
mation. Furthermore, since information 
“Seflection is costly, a large part of the voting 
has neither the incentive nor the ability 
“become fully informed about all the aspects 
inplex public policy issues.’ This idea is 
captured here by assuming that a key stochas- 
parameter of the mapping of policies into 
"aromes is observed by policy makers in of- 
2 but not by the voting public. 
People’s welfare depends on outcomes, and 
policy choices affect outcomes. However, out- 
" games are also influenced by external circum- 
sages about which policy makers normally 
F; wave better information than does the general 
fi goblie Thus, depending on external circum- 
:.gaaees, a right-wing policy may or may not 
3 be desirable from the point of view of a ma- 
EY sity of the population. Suppose that it is, and 
E f3t-the incumbent party is fully informed 
resident Nig aut this. In order to implement a policy, the 
ternational partyin office has to elicit support. To that 
sublic of Calli ##e, it has to transmit to the public its private 
“Mi Seenation about the relative desirability of, 
ase, right-wing policies. When the in- 
tis a recognized left-winger, his ability 
‘and to implement the required policies 
than the ability of a right-wing in- 
The reason for this apparent incon- 


pninsulaing 
partner to 

adat of Egy 
st Arab Ie i 
military: 
establishe 
umunist T 


are Thomas Gilligan and Keith Krehbiel 
Alesina and Cukierman (1990), Kenneth 
). Joseph E. Harrington, Jr. (1993), John 
954), and Christian Schultz (1996). David 
a (1993) emphasizes that legislation (or pol- 
ally) is a means to an end rather than.a final 
of these authors also postulate, as we 
nt parties have better information about 
of the mapping of policies into outcomes 
general public. 
À ine. leads to specialization in knowledge 
OW more about some things than others) 
ignorance. This idea goes back to Anthony 
4 .10r recent treatments see Arthur Lupia 
"San (1993), and John Matsusaka (1995). 
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gruity is that the public has less reason to 
suspect that the right-wing policy is proposed 
solely because of the natural ideological ten- 
dencies of the party in office, i.e., it may be 
perceived as an objectively motivated policy. 

Another important element of our frame- 
work is the inability of voters to fully distin- 
guish policy shifts that are due to changes in 
(information about) the mapping of policy into 
outcomes from those that are due to intraparty 
politics.* This inability prevents the public 
from making precise inferences about govern- 
ment’s private information concerning this 
mapping (the state of the world). The reason 
is that the policy proposals of incumbent gov- . 
ernments are affected by shocks to this map- 
ping as well as by changes in the incumbent 
party’s preferred policy position. The second 
type of change makes the proposed policy a 
noisy indicator for the state of the world. 

Some conditions make a policy reversal 
more likely. The policy switch that—in view 
of external circumstances—is desirable © 
should be considerable and relatively rare. 
These conditions appear to have been satisfied 
in the episodes mentioned before. Major eco- 
nomic reforms, the trading of land for peace, 
and the opening of a pathway towards China 
are policy decisions of a scope that occurs in- 
frequently. Furthermore, policy reversals are 
more likely when the voting public has sub- 
stantial uncertainty about the governments’ 
exact preferences. In addition, the final out- 
comes of the policies under consideration oc- 
cur far in the future—that is why the voting 
public has to use the policy proposals of in- 
cumbents as signals for the likely future out- 
comes of the proposed policies. This feature is 
formalized by assuming that voting takes place 
before the realization of final outcomes.° 

The basic ideas of the paper are presented in 
Section I with a representative-democracy model 
in which, after receiving some information 


~ r nh 


4 Michael Laver and Norman Schofield (1990) have 
stressed the effects of intraparty politics on policy choices. 

> This timing is a crucial difference between our model 
and that in Harrington (1993). Harrington uses an other- 
wise similar informational structure to derive implications 
from voter uncertainty to policy manipulation for reelec- 
tion purposes. 
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.. about the mapping of policies into outcomes, 
the incumbent party commits to a policy plat- 
form.° This is followed by elections. If the in- 
cumbent is reelected, he implements the 
proposed policy; if another party is elected, 
it picks the policy that is nearest to its own 
preferences given the realization of the sto- 


chastic component of the policy-to-otiicomé ~ 


mapping.’ 

We show in’Section II that moderate right- 
wing policies are more likely to be imple- 
mented by right-wing parties (and similarly for 
the left), but extreme right-wing policies are 
more likely to be implemented by left-wing 
parties (and vice versa). Section III contains 
comparative statics. In Section IV, we discuss 
some implications for credibility. Section V 
concludes, pointing to the next steps in this 
research agenda. 


I. The Model 


The economy consists of a large number of 
individual voters with different preferences 
over a single policy issue. The utility of a type 
j voter is given by: 


(1) -|£ = œ+ y| 


where x is policy, c; is a constant, and y is a 
normally distributed stochastic variable with 
zero mean and variance o2. The superscript e 
attached to x and y denotes expected values of 
these variables conditioned on the information 


® We are assuming that the incumbent takes an action 
that commits him to a future policy (like making a state- 
ment or sending a bill to Congress) thereby revealing part 
or all of his private information to the public. Although 
the action taken by the policy maker might include a ver- 
bal statement, it is not ‘‘cheap talk” in the sense of 
Vincent Crawford and Joel Sobel (1982). As stressed by 
Cukierman and Nissan Liviatan (1991), announcements of 
future policies by incumbent politicians are not necessarily 
costless from their point of view. Thus, although our an- 
alytical structure bears some resemblance to models of 
information transmission in debates—such as those in 
Austin-Smith (1990, 1992)—it is based on costly rather 
than costless signalling. 

7In Cukierman and Tommasi (1998), we explore the 
same ideas in the context of a referendum game (direct 
democracy). The broad conditions for the type of reversal 
discussed in this paper apply to both institutional 
structures. 


































available to the voter. (c; + Y°) is the 
ceived) ideal policy of a type J Voter, 
pends on the type-specific “‘taste’’ pans 
C; as well as on the voter’s perceptio 
realization of an exogenous state-of-najyi 4 
rameter, y, that induces a unidirectional go 
in the preferred policies of all voters, 
`y is meant to capture the effect of eyt 
circumstances on the ideal policies of y 
Voters have well-defined and stable 
ences over outcomes, but the mapping of fa 
icies into outcomes, and hence the in 
utility function over policies, has a stog 
element. 

People generally have different prefe 
opinions about the desirability of altem 
policies. But when they learn that ther 
been a shift in exogenous circumstan 
voters shift their preferred policies in the sy 
direction. To illustrate, consider the ‘‘landig} 
sus peace’’ issue in the ongoing negotiaia ai 
between Israel and Syria. Different indivi 
in Israel have different opinions about | 
much territory to give up for a peace of a gira fF 
quality; some would give up very little m 
others a lot. In the language of economics, tag 
possess different marginal rates of subst 
tion. But the policy preferred by each indie 
ual (how much land to give up) also de 
on his or her percéption of the ‘‘deal’i tf 
Israel can work out with Syria. Again if 
language of economics, preferred policies 
depend on the exogenous marginal rail’ i 
transformation between land and peace. Y 
they believe that it is possible to obtaas 
higher-quality peace for a given amount olf 
ritory, all Israelis advocate more. dois ¥ 
policies, although hawks are still wilgi 
give up less territory than doves. The bea, 
geneity of preferences is captured by ¢& 
common effect of perceived external ¢ 
stances by y°. 

In a representative democracy, voters 0 
choose policy directly. Instead, they @ ue 
elected officials who decide what policy? 
low. We model this institutional setup ! 
tulating two parties, the right R and the 
that compete for office. Each party cares 
the issues as well as about being in offi 
se (this Downsian component is call 
rents by Rogoff and Anne Sibert, 198 
use A to denote this ‘“‘value of office: 































wo. 1 


o Incumbent 
announces 
platform 


a Incumbent 
observes 
y and 6: 


as a stochag 

g. 
nt preferen 
that there} 


to denote the (deterministic part of the) 
öint of party i = L, R, with c} < cp. 

jdea well established in political 
@e-—see Otto Kirchheimer (1966) and 


umstances am and Schofield (1990) —is that different 
ies in the Sag és cater to the interests of different but 
‘the “Tang A buous groups of constituencies. The ideal 
ig negotiati y of the left-wing party reflects a com- 
ent individ fyomise between the different leftist groups, 
ms about\b@lMM cad aimilarly for‘the right-wing party. The rel- 
eace of agi ‘ability of each such group to affect the 
very litib EES ‘party's policy position is usually in a state of 
conomics, Ia. d not fully known by the general public. 
es of subi We model this effect by postulating that the 
y each indi -esefetred policies of parties are subject to in- 
)) also deM Baparty shocks labelled s; i = L, R. The 
re ‘‘deal! 


4 all: tions of these shocks, which are inde- 


a. Again q tly and normally distributed with mean 

ed polici & 270 atid variance 2, are known to the parties 

arginal 1 the public. In addition, as is the case 

ad peace: ofers, the parties’ ideal points also de- 

zle to ob Y. Those factors are incorporated by 

n amoun lating that the bliss point of party i is 
: (Gi + e; + y), and its objective func- 

still will l 

res. The i 

red by cj h- |x — (c, te, + y)|. 

*xternal 


stic component, c;, is common 


cy, voters% & but the stochastic component, s; 
ad, they wn by the public. Similarly, the re- 
hat polic: -Of y is observed by"the incumbent 
ial setup, Not by the general public. This is a 
R and the: : y of expressing the presumption 
party cares gg RA ment has a more precise notion 
ing in OUR ne general public about the effect 
at is cale gr 4s cumstances on the way in which 
ibert, 17 ~Ments map into outcomes. Since 
of offic at party has its own policy pref- 
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o Incumbent reelected 
è Implements platform 


ee eee 


o Challenger elected 
o Picks his preferred policy: cr+ x+y 


FIGURE 1. SEQUENCE OF EVENTS 


erences, it is not able to transmit fully this in- 
formation to voters. But the voting public 
normally learns something about the realiza- 
tion of y from the incumbent’s policy 
proposal. 


A. Timing, Information, and Elections 


It is convenient to divide the sequence of 
events, illustrated in Figure 1, into two peri- 
ods. At the beginning of the first period, the 
incumbent observes y and his own «. He then 
makes a proposal x. After having observed the 
incumbent’s proposal, the general public-votes 
for or against the incumbent. In the second and 
last period, public policy is carried out. If the 
incumbent is reelected, he carries out the pol- 
icy proposed in the first period (the proposal 
is a binding commitment). If the challenger 
gets elected, he picks the policy that maxi- 
mizes the ex post value of his objectives 
(knowing y and his own e). 

This way of modeling the interaction be- 
tween the voting public and government is de- 
signed to capture the fact that a policy has to 
satisfy two conditions in order to be imple- 
mented. First, the government has to propose 
the policy; second, the proposal should draw 
sufficient public support. Obviously, the 
model can be interpreted literally as stated in 
the previous paragraph. But it is also possible 
to interpret the model mote broadly, as cap- 
turing the fact that a policy cannot be imple- 
mented if it does not gather sufficient public 
support or if it raises too much opposition. Un- 
der such a broader interpretation, a ‘‘vote 
against the incumbent’’ would refer to a state 
in which the proposed policy is abandoned af- 
ter some trial period for lack of sufficient 
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public support (even in the absence of formal 
elections), while a ‘‘vote for the incumbent” 
permits the continuation of the policy. 


Note that there is an asymmetry between the ` 


incumbent and the challenger. While the first 
commits to a policy prior to elections, the 
‘second—if elected——gets: to choose his pre- 
ferred policy. This asymmetry reflects the 
presumption that reputational and other consid- 
erations make it more difficult to adjust policy 
for the incumbent than for the challenger.’ 

At election time, each individual votes for the 
party that he perceives is going to implement a 
policy nearest to his own preferred policy. But 
this depends, in turn, on the individual’s percep- 


tion of the exogenous shock y. Knowing that the . 


incumbent’s proposed policy partially reflects 
that party’s private information about y,. each 
voter utilizes governmental policy as a signal for 
(in a way that will be made explicit below). 

Let g(c;, w) be the fraction, or density, of 
voters with preference parameter c;, where w 
is a stochastic shift parameter. Given the re- 
alization of w, the distribution of c; is nonsto- 
chastic, so that electoral uncertainty is 
captured by the distribution of w. Each reali- 
zation of w induces a different nonstochastic 
distribution of c;, each with its own median Cm. 
The median c,, is, hence, a function of w; we 
assume that it is a linear function, and that w 
has a uniform distribution. This implies that c,, 
also possesses a uniform distribution. Let ¢ 
and c be, respectively, the upper and lower 
bounds of this distribution. 

Since voters’ preferences are single-peaked, 
the outcome of the elections is determined by 
the preferred policy of the median voter. The 
party whose expected policy position 1s near- 
est to the ideal policy (Cm + y^) of the median 
voter on election day wins the elections. Un- 


8 Policy is known to react to opinion polls even be- 
tween elections. See Benjamin Page and Robert Shapiro 
(1983) for U.S. evidence. These considerations are devel- 
oped in more detail in Cukierman and Tommasi (1998), 
with a focus on developing countries. 

? Daniel Ingberman (1984) and M. Daniel Bernhardt 
and Ingberman (1985) analyze the notion that the plat- 
forms of incumbents are more reliable indicators of their 
future policies than are the platforms of challengers. 
Schultz (1996) discusses the case in which both parties 
commit to their respective policy platform. 































certainty about the distribution of voter 
points induces probabilistic voting-—ths 
ability of reelection of an incumbent È 
in general, strictly between zero and 9 
Peter J. Coughlin, 1992).!° 


B. Equilibrium 


Suppose, for concreteness and withoy} 
of generality, that the incumbent is pay 
Since policy must be chosen prior to electi 
and since the election outcome is uncemg. 
the incumbent party takes into consideri 
the effect of the current policy choice op% 
probability of reelection. More precj 
chooses policy x, so as to maximize: `? 


(2) P'x)[h -— |x: — (cr + ext yI] 
tao P*(x,)] 


x [-|EGely) — (cr + II. 


P*(-) is the probability that L will be rely 
and depends on policy choice x,. The sous 
term in the incumbent party’s objective LE SA 
utility it expects to obtain if it loses the electa ame 
and policy is chosen by the challenger.” $$ 


from R is E(xp|y). A 
The functional form of P+(-) depends ont 
way the voting public forms its percepti@t 
y. However, the formation of this percep! 
depends on the (optimal) policy rule of thew 
cumbent which depends, in turn, on r 
equilibrium, the policy rule that the publicp® 
tulates in order to form perceptions of 7! 
to be consistent with the actual policy rule 
lowed by the incumbent, and the publics 


10 It is commonly accepted that probabilistic Y 
more realistic than deterministic voting. For &? 
Randall Calvert (1986 p. 54) points out that “tis 
in harmony with the importance attached by walt 
political scientists to the role of imperfect info 
politics.” 

1! We assume that the intraparty shock £: 
persistent.” That is, it persists into the postelectio? 
only if the incumbent party is reelected. Besides 
fying the algebra considerably, this assumption "yr 
descriptive realism since success in the elections” og 
increases the durability of a given deviation from 3 
bliss point, c,, while failure normally reduces ! 












































No.1 


ormation process assumed by the 
thas to be identical to the actual pro- 
eof expectation formation. A full definition 


prium follows. 


‘on 1; An equilibrium is a-pair of pol- 
metions, XÈ Y> £1) and xg(Y, Er), together 
yoters’ beliefs, y*(x,), such that: 


The incumbent party chooses policy (prior 

g elections) so as to maximize (2). 

elected, the challenging party chooses 

icy after elections so as to minimize 

a (Cr + ER + y) | . 

iven his perception of y (and of the policy 

‘af the challenging party), the median voter 
s for the party whose expected policy is 

pearest to his ideal point. 

Voters’ perceptions about y (and about the 

policy of the challenging party, if elected) 

ae formed rationally, using all the available 


- 


‘information. 


lf the right-wing challenger is elected, he 
lets-the policy that maximizes his ex post 
tives, namely, he implements policy: 


Xr = Cr t Er +t y. 


policy expected by voters from R, prior 


® ëlections is, therefore: 
Xk = E(xr| x) = ce + 7’. 


se {hat the policy expected from R depends 
Policy proposed by the left-wing incum- 

-f to elections. The reason is that the 
Sf policy by both R and L depends on 
. at voters receive information about y 
choice of policy by the left-wing in- 
aS More complicated, since he has to 
Consideration the effect of his current 
= m voters’ expectations and, through 
3 ae the probability of reelection: Using 

‘vane the fact that E(é,) = 0 and that E(xp| 


H the objective function (2) can be- 


Dlh = Ie = (er + ez + WI] 


H — PU) — co). 
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The equilibrium solution for x, is obtained 
by the method of undetermined coefficients. 
We first postulate that the equilibrium choice 
of x, is the following linear function of y and 
EL: 
(6) X, = B, + byy + br 
where B,, b;,, and br, are coefficients to be 
determined. Observation of x, by voters does 
not enable them to disentangle the effect of y 
from the effect of e; it is easy to verify that 
this implies that b}, = bzs = bz. Therefore, (6) 
simplifies to 
(7) x, = B, + bily + &). 

Voters know the decision rule in (7), ob- 
serve x; prior to elections, and use it to im- 
prove their forecast of y. Since B; is aknown 
combination of parameters, it is easy to show 
that the expected value of y, conditional on 
X,, is given by (Morris De Groot, 1970 p. 
169): 


x, —B 
@) y= Ela) = 0 E 
L 
where 
2 
©) dg o2 T o? 


Since the policy of any incumbent becomes 
more right-wing the larger the y (it will be 
shown that b, > 0)—and this is known by the 
public—a more right-wing policy (a larger xz) 
is partially interpreted by the public as being 
due to a larger y. But this signal is noisy be- 
cause the incumbent’s policy choice is also in- 
fluenced by the intraparty shock. For this 
reason, the impact of x, on y° is weighted 
by 0. 


-ASSUMPTION 4: We assume that voters 


believe that the policy proposed by the in- 
cumbent L is always to the left of that of the 
challenger R, or: x1, = xı < Xp. The assump- 
tion states that party R is always perceived 
to be the right-wing party. Conditions on 
the model’s parameters for the fulfillment 
of this assumption are presented in the 
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- Appendix (see also Alesina and Cukierman, 
-~ 1990). 


Assumption 1, together with the decision 
rule of the median voter, implies that there 
exists a critical value of cm, denoted c‘,, such 
that if cm = c£, the incumbent L wins the 
elections, and if cm > c%, the challenger R 
wins the elections. The value of c$, is ob- 
tained from: 


[xz = y° — ch) = Ixk - y° ehl, 


which, due to Assumption 1, is equiva- 
lent to: 


(10) Cy? + ch) — XL = Xk — (YF + Ch). 


Rearranging (10) and using (4), we 
obtain: 


(11) cE, = 3 (cr + x — y’). 


The probability P“(x,), that the left-wing in- 
cumbent is reelected, is equal to the probabil- 
ity that the ideal point, c,,, of the stochastic 
median voter falls to the left of c£, Since Cm is 
uniformly distributed, this probability is given 
by“: 


(12) P(x) = 





Using (8) in (11) and the resulting expression 
in (12), we obtain: 


(13) P“) 


1 B, 
= ——— | cr — 2c +0 + dx, |, 
ron : b, | 


where d = 1 — 6/b,. 

Substituting (13) into L’s objective function 
(5), we obtain the following first- and second- 
order conditions for an internal maximum: 


Obviously, the probability function is bound to be- 
long to [0, 1]. In Part 2 of the Appendix, we analyze what 
happens when c < c and when cf, > ¢. 








Case 1: If x, > C, + E + y, the fing. c=] 
second-order conditions are given res a 

by: 

(14) d(h = X + EL + Y + Cp) 
ina — XE- o)P*(x1) = 0, 


(15) -d < 0. 


Case 2: If x, < c + ez + y, the first 
second-order conditions are given respects, 
by: at 
(16) d[h + X — (2c, + Ex + y) + ca] : 


+ (E — APH) = 0, 
(17) d<0. 


i 

Rearranging the first-order conditions, EE wa 
obtain: f 
Case l: (xL > c, + €, + y). 


B, ts 
b, 


1 : 
(18) XL = 34 [er + h) = Cr — § 


1 
+ 5 + €L). 


Case 2: (x1 < c, + & + 7). 


B 
(19) ay = 55 | 22 on 85 
+ d(2e, — cr - h 
per inens 
A 2 Y T EL, 


Equating coefficients across equa 
(19), and (7) we obtain: 


(20) by = 


NI- 



















H NO: J C 


e firs 
respect? y 
1 7 qy 
i [ee on z) 
R) g? 
+t eo (1 2 z) | 
Jo i Def Oe 
for x, > Cu + & + y (Case 1). 
g? 
he firs 5 [e 7 cn ji 2) 
1 respect i 
+ (2c, — Cr~ a(t = 2) 
Os 
y) + i | for x, < Ci + e, + y (Case 2). 
Equation (20) implies that (in both cases) 
ie oo 
A o+ o? 
H, Fabat the second-order conditions for a max- 
zondition A =ar in the two cases are, respectively: 


2 


o, <o}  forx>c+e,+y. 


o > o} forx,<c,+e,t+y. 
a (13), (15), and (17), P“(x,) will be in- 
sng in Case 1 and decreasing in Case 2. The 
#502 underlying the two cases is the follow- 
:When L moves policy tothe right, he triggers 
conflicting effects on his reelection prospects. 
Y, this moves him closer to the center 
political spectrum and increases his prob- 
sF Of reelection. We shall refer to this as the 
Wing effect.” But the shift of policy to the 
: alid raises the forecast of y. This moves the 
of all voters to the right and increases 
“Ora prospects of the challenging right- 
ty. We refer to this as the “expectation 
When o2 < o2, this shift of policy in- 
4 Moderate reduction in the reelection pros- 
x -vla the expectation effect, and the 
ane effect dominates, raising the reelection 
the left-wing incumbent. It therefore 


“ak Choose a policy that is more centrist 





Policy (c; + e, + y). When o2 > 
©1Pectation effect dominates and a shift 
Ces, on balance, L’s reelection 
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prospects. It therefore chooses a policy that is 
more extreme than its ideal point. 

It is shown below that in both Cases 1 and 2, 
a right-wing incumbent (RWT) is more likely to 
propose a relatively right-wing policy. In Case 2, 
as policy shifts to the right, the probability of re- 
election of a left-wing incumbent (LWT) falls and 
that of a RWI rises. Hence, policy reversals (i.e., 
higher chances of a right-wing policy being 
adopted by a left-wing incumbent) are ruled out 
in Case 2. Since we are interested in the condi- 
tions determining policy reversals, we concentrate 
on Case 1 in the following. o?, < o% is, therefore, 


-one of the conditions required for policy rever- 


sals. A relatively small o? means that the mass 
assigned to the tails of the normal distribution of 
y is small in comparison to the tails of the distri- 
bution of e; (i.e., that the probability of y taking 
extreme values is small in comparison to the 
probability of s; taking extreme values). 


Il. Which Party is More Likely to Implement 
Which Policies? 


We now come to the central issue, stated in 
the title of this section. To do that, we focus 
on a comparison between the behavior of left- 
wing and right-wing incumbents. 

A derivation equivalent to that of the pre- 
vious section, for a right-wing incumbent (for 
g? < a?) leads to: 


XR = Br +3(y + ER), 


with the mean (equilibrium) policy choice of 
a right-wing incumbent being 


1 2 
22) Be=5 [(: 2 “Nie —h) 


2 
$ (1 $ 2) 00 - e| l 


The probability of reelection of a right-wing 
incumbent as a function of his (individually 
optimal) policy choice is: 


(23) P*(xe) 


l 








an 
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Notice, comparing Case 1 of (21) and (22), 


' that as the value of office, h, increases, both 


parties’ policies converge to the center (the 
Hotelling effect). On the other hand, as the de- 
gree of electoral uncertainty increases (a larger 
Z and/or smaller c), the policy of R becomes 
more right-wing (and that of L becomes more 
left-wing). Finally; cz and c, also enter with 
the expected sign. 

Let PI'(x) bė the probability that incumbent 
party i implements policy x, and Q'(x) be the 
probability that incumbent party i proposes 
policy x. For a policy to be implemented, it 
has to be proposed by the incumbent party 
and, given this proposal, the incumbent party 
has to receive public support via reelection. It 
follows that the probability of implementation 
of policy x by party i is: 


PI(x) = PP), 


where P'(x) are the probabilities of reelection 
as stated in (13) and (23). 

Notice that x; = B; + 'h(y + e;) implies 
that 


Xi NIB;, VJ, . 


with V = (03, + 02)/4. Hence: 


— R)2 
Qe) = nv) exp|— aera | 


To simplify the calculations, we confine 
ourselves, henceforth, to the symmetric case: 
in which ¢ = —c > 0 and cp = —c, > 0. This 
implies that 





2 
a\h ot. 
(1-2) Fe 
At+dx A-— dx 
= d PE) =-——— 
P(x) 7 (x) a 





where 


2 251 T 

A=2(1-2 c+ 1-= | & 
Ge oe + o?) a4 

o(a? — o?) 


(abt oye 























Notice, as before, that the (average) policje 
each of the parties contains the followin 
tors: an electoral uncertainty effect (Bij 
ing in ¢), a Hotelling-Downs effec. 
decreasing in h), and an ideological prefer: 
effect (B increasing in c,).’* E 

Let 


be implemented by a right-wing party thak 
a left-wing party; the converse is true Wa 
API(x) < 0."* Let a 


LEMMA 1: Sign[API(x)] = sign[FQ)}- 


PROPOSITION 1: If Vd/B < A <%® 
the range of policies x can be partitioned 
the following way: 


(1) There is a central region (x, x) in 
the conventional result obtains (poli 
to the left of the center of the pol 

Bee 


! We assume, henceforth, that the mean po" 
posed by a RWI is larger than the mean policy P 
by a LWI. That is, Be > B,. This is the case WE 
systematic difference between the ideological PO" 5 Jims 
the two parties is sufficiently large in compariso ae enc 
(shared) love of office, h. In the symmetric cash 77. i 
plies that By is positive, B, is negative, and that 
positioned at equal distances from the center oft 
ical spectrum (which is at x = 0). ' , 

14 Notice that we concentrate on comparing 
bilities of implementation attached to differe: 
bents. We believe that this is the relevant comp? 
in our (two-period) model, only the incumbent 
an interesting strategic way; the behavior of ¢ 
passive. 
































m are more likely to be imple- 
ented by L, and policies to the right of 
zo center are more likely to be imple- 
ted by R). i 
licy Reversals) There is a region out- 
(x, X) in which very left-wing policies 
< x) are more likely to be implemented 
R, and very right-wing policies (x > x) 
are more likely to be implemented by L. 
ly Nixon) For even more extreme val- 
above x} and below x}, (where 
> Xand x} < x), only the unlikely party 
can implement policy x (i.e., P®ix > x8] = 
and P*{x < x1] = 0). 


ctr 


fs of Lemma 1 and Proposition 1 are pro- 
j in the Appendix. Figure 2 summarizes the 
ion of the proposition. Panel A shows the 
f probabilities of each proposal, Q(x). Panel B 

-Saw the probabilities of winning the election 
ga function of the proposal, P'(x). Notice that 
picture of PE(x) = Px), shown in 
èl C, can be obtained by increasing the mass 
the right tail and decreasing the mass of the 
if tall of Q'(x), and conversely for R. Panel D 


HI. Comparative Statics 


Figure 2, Panel C, suggests that, in general, 
selatively extreme policies are unlikely to be 
‘==plemented by either party. The reason is that 
eme realizations of the stochastic compo- 
6f the mapping of policies into outcomes 
the intraparty shocks are relatively rare. 
et, when such events do appear, the 
bility that the corresponding extreme 
Will be implemented by an ‘‘unlikely’’ 
is greater than the probability that it will 
lemented by the “likely” party. 
vely, the conditions that are condu- 
Policy reversals can be summarized as 
8: First,-the variability of intraparty pol- 
erences has to be large in comparison 
variability in the mapping of policies 
Omes. This assures that when the 
2 Poposal of the incumbent party shifts 
& hee center, the Hotelling effect dom- 
a *Peclation effect, thus increasing 
; amci 2 prospects of the party. Second, 
Rec. + more likely to occur the more ex- 
licy that is being proposed. Since 
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extreme policies: are proposed infrequently, 
policy reversals will also be infrequent events 
that will be associated with extreme and rela- 
tively rare realizations of y. 

In order to gain additional insights, we will 
now perform comparative statics to see how 
the range of reversals, the complement of (x, 
xX), varies with the parameters of the model. 
We know that x, 0 and x are the three roots of 
F(x) = 0 (although only the location of x and 
X varies with the underlying parameters).'° Let 
a denote any parameter. Applying the implicit 
function theorem to F(x) = 0, 


dx 1 2x |. OB OV 
C4) a ROA a 
ðA ad 
Oa 2a 
A — dx 
ðA ôd 
ea eee 
_ Oa da 
At+dx |’ 


where x = x, 0, x. 
Applying (24) to the electoral uncertainty 
parameter č, we obtain 


2 
2(1 rs 3 
dx x E 
u S F'(x) V 
g? 
a(i -3e 
T A? =, (dx? 


'5 Note that the probability of implementation of the 
policy x = 0, at the center, is identical for both parties. 
The fact that F(0) = 0 is a consequence of the postulated 
symmetry, but the general qualitative nature of Proposi- 
tion 1 does not depend on the symmetry assumption. 
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A. PROBABILITIES OF POLICY BROPOSALS BY 
INCUMBENTS L AND R 


’ 


PRE). P) 





x Xp 


o 


B. PROBABILITIES OF REELECTION AS A FUNCTION OF 
POLicy PROPOSALS 





C. PROBABILITIES OF IMPLEMENTATION: 
PI) = QOP 





D. F(x) = PI(x) 


— P(x) 


FiGURE 2. PROBABILITY OF POLICY IMPLEMENTATION BY 
EACH TYPE OF INCUMBENT 


Notice that 


<0 atx=x<0 
=0 atx=0 
>0 atx=x>0 


x 
F'(x) 





and that the expression in braces in (25) is pos- 
itive. This leads to: 



















RESULT 1: An increase in electoral 
tainty (measured by C), reduces the range 
“policy reversals.” < 


Similarly, ~° 





Since (03/02) < 1, (1 — 2(03/02)6) is posig 
It is easy to see the following: 


RESULT 2: An increase in the deg 
party polarization or ideological distance] 
tween the parties (measured by Cp), decreta 
the range of ‘‘policy reversals.’’' 


sured by 0 2), increases the range ø n ; JO 
reversals.’ 


we are studying should be more commoa# 
countries characterized by ‘‘catchall’’ pute 
that comprise a wide spectrum of relai L. 
heterogeneous constituencies, or by coals pe 4) 

governments within which there are ag? ¥ 
ments (deals) which are not easily observ 
the general public. The Peronist movem# 
Argentina fits the first interpretation quite™* # 
and some of the broad ‘‘unity’’ govem 

in Israel conform to the second. i 


IV. Implications for Credibility” . 


Politicians usually justify their policy, 
posals by claiming that the state of the 


'6 Schultz (1996) addresses other effects of Pom 
in a model that also emphasizes asymmetric 18 
about a parameter that he calls “the functionisé, 
economy,” which plays the exact same role a5 “s E 
1? The notions of credibility that we use in ot + ays 
are by no means the only ones possible. Alte... TA 












































a o 

one that Gif believed) would elicit 
os um public support for their proposed 
es. In order to elicit support for their 
“e politicians argue that those policies 


AeA oial for the majority of the popu- 

sion. In the context of our model they 

o ‘i d. argue that y is quite large (small) 

aa “they propose a right (left)-wing 
e 18 


; ate section, we use two notions which 
$ sre the credibility of (implicit) statements 
4 ost Y by looking at the posterior beliefs 
E about y held by voters after having observed 
aalicy proposals. We shall continue to refer to 
a sy | olicy that is to the right of the center of 
§ ág political spectrum as ‘‘right wing’’ and to 
aay policy that is to the left of this center as 
deft wing.’ (For the symmetric case, the cen- 
af the political spectrum is at 0.) 
$ 3. The credibility of a given policy maker when 
; Be proposes a relatively left (right)-wing pol- 
f sey can be characterized as the posterior prob- 
4 ability that y is smaller (larger) than a given 


2)) is past 


the degred 
‘al distanced 


) Cp), decr 
+316 


RA 
yO 
0 


acertain y B vzlse of y. l 
positioni (Maa -The public’s observation of the policy pro- 
ange of 5} foal x, given its knowledge of B; and of b; = 


a {s equivalent to an observation of (y + 


k =-2(x — B;), fori = L, R. We are interested 


policy t sia inferences about the unknown value of y 
Jore comma =de from the observation of (y + s;). This 
catchall’ 'ipa petscivation is a random drawing from a nor- 
um of reli distribution with an unknown mean y and 
., or by cosi ante o2. The prior distribution over that 
there are” anjwnh mean is N(0, o2). Using Theorem 1 
asily obs section 9.5 of De Groot (1970 p. 167), the 
nist movema EnO distribution of y, f(y, x) is N(yi(~), 
station qui here yi) = Oy + 8) = 26x — B), 
ty” governs = ozo Mo + 02). 

md 3 7 


‘redibility p$ 
H 

i their polit J. 

state of the : 
wo, 


Credibility are discussed in Chapter 11 of 
wman. (1992). See also the discussion of Allan 
Paul Masson (1994) in Section V of this paper. 

pone (1994); political parties make announce- 
out “the way the economy works” (y in our 

J dthen Propose policies. In his model, there is 
t tat ¢ rgence of policies (the median voter the- 


r effects of. ~ inlas parties announce different y’s in an 
symmetri uence people’s reduced-form preferences. 
‘the functi model the formation of beliefs explicitly. 
same role a mapping of announcements into be- 


deduce the belief-formation process from 
r- yesian) behavior of voters. 
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fay, x) 





Vx) 9X y 


` FIGURE 3. POSTERIOR BELIEFS ABOUT y, AFTER POLICY 


ANNOUNCEMENT xX 


Recalling that B, < Ba, it is easy to see that 
¥R(x) < yilx) for all x. This means that any 
given policy, x, is interpreted by the public as 
being associated with a lower (posterior mean) 
value of y when this policy is proposed by a 
RWI than when it is proposed by a LWI. The 
intuition is simple. Being aware of the system- 
atic ideological differences between the two 
parties, the public expects to observe the same 
policy from both incumbents only when the 
RWI observes a lower value of y than his left- 
ist counterpart. 

The posterior distribution that the public as- 
Signs to y after observing a proposal x, is de- 
picted (for the symmetric case) in Figure 3. It 
is clear that, for any given x, 


(26) fily, x) < fry, x) for y < 0x, 


fy, 9) >fely,x) fory > 6x 


and 
Fi(y, x) < Frly, x), VY, 


where F is the cumulative distribution function 
of f. 


RESULT 4: (Credibility of implicit state- 

ments about y across policy makers) For any 

given (common) policy proposal: (i) a left- 
wing policy maker is more credible, than a. 
right-wing policy maker when he claims that 

right-wing policies are desirable, and (ii) a 

right-wing policy maker is more credible than 

a left-wing policy maker when he claims that 

left-wing policies are desirable. 


2. Consider now ‘‘extreme’’ policies. Ex- 
treme policies are defined as policies that are 
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to the right of the average policy proposal 
made by a RWI or to the left of the average 
policy proposal made by a LWI. Politicians 
can convince a majority that an extreme policy 


_ 2 ds necessary by claiming that there has been 


an extreme realization of y. We therefore mea- 
sure the credibility of policy proposals by the 
posterior probability assigned by the public to 
the event ‘‘y is larger than a sufficiently large 
and positive y’” when an extreme right-wing 
policy x is proposed——-and by the posterior 
probability assigned by the public to the event 
‘ty is smaller than —*’’ when an extreme left- 
wing policy is proposed. 

Consider a policy x > B > 0 and its mirror 
image (in the symmetric case) —x, and let 
y > 20(B + x). We know that yi) = 
26(B + x) and that y{(—x) = 26(B — x). It is 
easy to see that B > 0 implies 


yix) — (=f) Sy yia 


that is, deviates from yj(x) less than (—ŷ) 
deviates from y{(—x). This yields 


(27) [1 — Fil¥, x] > FẸ; ~x). 


A similar derivation for a right-wing incum- 
bent yields: 


(28) Fr(—}, =x) > [1 — Fal¥, x)]. 


In words, (27) and (28) imply: 


RESULT 5: (Credibility of a given policy 
maker across extreme policies) (i) A left- 
wing policy maker has more credibility when 
he proposes a significant policy shift to the 
right than when he proposes a significant 
policy shift to the left. (ii) A right-wing pol- 
icy maker has more credibility when he pro- 
poses a significant policy shift to the left than 
when he proposes a significant policy shift 
to the right. - ete 


V. Conclusions 


When governments have better information 
than do voters about the way in which policies 





























map into outcomes, policy proposals Ae i, 
information. This may lead to situation per: 
which extreme, but rarely proposed, polis. í 
are more likely to be implemented by 
likely’’ actors. A necessary, but not suffi 
condition for such reversals is that the Unie 
tainty about the preferred policy posit 
parties be~sufficiently large COMparéd ty g es = 
uncertainty about the state of the world, . 
A main result of the paper is that polj 
versals are more likely following the realy, 
tion of extreme and relatively unlikely vals 
of parameters that map policy choices into og basi 
comes. A corollary to this result is that pig Sse 
reversals occur infrequently. Bs 
This logic can be applied to several poly | ` 
arenas. In Cukierman and Tommasi (1998jk Bera 
apply it to the specific issue of market-orieng 
reforms undertaken by many countries g 
the last decade. We argue there that politicies $; 
coming from the left of the spectrum, wa jie 
faced with the fact that such policies ia 
necessary, had a comparative advantage i $% 
convincing people of the long-run necesi £ 
of these changes, even if they hurt now. Tha 
is some preliminary evidence in favor of 
point. John Williamson asked the cont. 
butors to his 1994 edited volume to check 
conventional wisdom that market-oriented# 
forms are creatures’ of right-wing gover 
ments. Little support was found for suchë] 
association, as summarized by Williamson 
Stephan Haggard (1994). As a matter of fat 


oriented reform implemented by what 
classified as right-wing governments. In n: 
ingly, these three cases. included the two, 
itary dictatorships in their sample (C 
Korea). This seems consistent with the | 
tion that, under democratic conditions, 
shifts in policy are more likely to inv ji 
versals of a party’s traditional policy: 
tion.'? In Cukierman and Tommasi (199 


(1997) argue that the implementation of reform 
might suffer from time-consistency problems. ayes 
benefit from early reforms but suffer from later ont Faiy 
blockade the later stages, making some reform pe“ 

inconsistent. An implication of the logic of y 
that policy makers may, then, initiate a reform >% 
by implementing those measures that hurt thers 
constituencies. 












































sals: me cases, other than those included 
ane Williamson volume, which display sim- 
e 


wie. A characteristics. 
ted bya me well-known foreign policy reversals 
nsistent with the framework of this 
“tench as the opening up to the People’s 
blic of China by staunch anti-Communist 
Nixon, and the trading of land for 
hawkish Israeli Prime Minister Begin, 


ove. Begin managed to overcome 
g the nd sometimes violent, opposition to the 
likely Aaa @santling of Jewish settlements in the Sinai 
“is that Feninsula largely because he had a long and 


as a hawk helped him convince voters 


sever oe eat agreement was beneficial for a majority 
z WBA: af Israelis. If the same policy had been adopted 
arke 7 


dovish Labor Party, mobilization of suf- 
sient public support would have been harder, 
# po impossible. Similarly, if the opening up 
#-China had been attempted by Humphrey 
‘than by Nixon, opposition to that policy 
| have been more difficult to overcome. 

analysis has some interesting implica- 


ectrum, {iim 
policies: Wa 
advantagel 
-run necésil 


urt now, WREE: oaa for the question of credibility. In partic- 
n favor OMS siar: we show that the credibility of a policy 
>d the COME pends on the ideological identity of the pol- 
Je to check Sey maker proposing it, as well as on the policy 
cet-onlena Feoposes. This is related, although not iden- 
wing goya al,'te the distinction between the credibility 
ad for suck olicles and the credibility of policy makers 


by Drazen and Masson (1994). In both 
Peters, the notion of credibility employed de- 
Upon the policy-policy maker pair. But 
fe also important differences. In Drazen 
On credibility refers to the likelihood 
ai nnounced policy will be carried out 
Aryen aan), within a framework in 
at y IS uncertainty about the polic 
onditions; z: $ type and about Dan Dod that 
ter his ex Post preferred policy. Here 
uncertainty about the policy of the 
ie Since he is committed to the an- 
= oa Hence, our concept of credi- 
"iot to the incumbent’s ability to 
f reel ers that the policy he will imple- 
tected, is better from the point of 


‘Drazen (1996) for a more general treatment. 
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view of the majority than the challenging 
party’s policy. In both cases there is uncer- 
tainty about the policy that will, ultimately, be 
implemented. In our case the uncertainty is 
due to political competition in conjunction 
with electoral uncertainty, while in Drazen and 
Masson it is due to uncertainty about the pol- 
icy maker’s type in conjunction with uncer- 
tainty about economic shocks. Each notion of 
credibility is natural whitin the framework un- 
der consideration. Combining the two frame- 
works may yield more general insights about 
basic institutional determinants of credibility. 
We leave this task for future work. 


APPENDIX 
Note: This Appendix focuses on the case 
a? < o%, which is necessary for policy 


reversals. 


1. Conditions on Exogenous Parameters for 
Assumption I 


Assumption 1 requires that 
(AI) x, =B, + Hy + &) 


< cr + O(y + EL) = XR. 


Using (21) we can rearrange (A1) as: 


(y + £1) 
(a2 + o3) 


o? g? 
a Ng 


(a2 — o?) 


The left-hand side of (A2) is a standard nor- 
mal. Therefore, the condition is satisfied with 
very high probability if cg is sufficiently larger 


(A2) 


< 


‘than ¢’ and/or the (positive) differetice (0% — 


gŻ) is sufficiently small. (It is also more likely 
to be satisfied, the smaller the value of h.) 


2. Taking Care of Corners 


Note: For the sake of brevity, we work di- 
rectly with the symmetric case, as in Section H. 
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The analysis in the text was undertaken 
under the implicit assumption that the prob- 
ability of reelection described by equation 
(13) belongs to (0, 1). For the incumbent L, 
this means that 


A,+ dx 
C 





(A3) ~ PL) = €(0,1). 


Let u = y + e. From the policy function 
xı = By, + ‘hm, it is easy to see that (A3) 
requires 


Mr E (ub HL), 


where pi? = —2(A/d + B,) and ul = 2[(47 — A) 
d — B]. (The corresponding values of x, are 
x? = —A/d and xl = (4€ — Ay.) 

In equilibrium, the probability of reelection 
P(x) can be thought of as the stochastic ag- 
gregation of the median voter’s best response 
to the policy function x,(,). This policy func- 
tion is, in turn, the best response to P“(x). We 
have established in Section I that P(x) = 
(A + axW4e and Xi (pr) = BL + our are best 
responses to each other, which is true only for 
uL E€ (ut, ui). Now we proceed to describe 
behavior outside that range. 

G) When u, < p}, P(x) = 0 and x(u) = 
B, + 'hu, are best responses to each other. For 
x, < —A/d, all potential median voters prefer 
to vote for the challenger R, so that P(x) = 0. 
Therefore, the incumbent faces the decision of 
either definitely losing the election, or com- 
mitting to a policy x, > —A/d. The first course 
of action gives a payoff of —(cr — c1) = —2cp 
and the second, an expected payoff of 





(GS 


de Jon — XxX, +c, +u 





It can be shown that [h — x, + c + ug < 
—2cp for x, > —A/d and u, < yp}, implying 
that the first course of action is preferred. This 
is demonstrated by showing that this inequal- 
ity is satisfied for x, = —A/d and u, = py? and 


_ This means that any policy function x, 


-6). For the sake of brevity, assume that 





















by noting that it is a fortiori satisfied f 
values of x, and lower values of u, 

Hence, when pz, < pp, the incumbent ches 
a policy that gives P(x) = 0 since, in gas 
attain P(x) > 0 he would have to com 
policy too far from his (ex post) prefeneg 


gives x, < —A/d Vu, < pt is consisten 
P(x) = 0 Vx < —A/d. In particular, xis 
B, + 'hu is one such function. 

(ii) When u, > pj, the previous 
function induces an x, so large that (A 
4c > 1, which means that all potential 
voters would vote for the incumbent ( 
P'(x), properly defined, equals 1), Į 
range, the electoral effect on policy be 
irrelevant, since the incumbent is going tof 
reelected with certainty. Hence, the incy 
party adopts its most preferred policy, xy 
H If the public knows that x, = c, + jy: 
forms expectations according to y* = Oy 
c,). This expectation function leads to ths 
cumbent L being reelected with probable 
one for x, = 2c/(1 — 0) — cpr or, equivalent 
u > 221 — 8). Notice that, depending oat 
values of the underlying parameters, yu} cal 
smaller than, larger than, or equal to 2a 


eee 


2e/(1 — 0). In that case, {P“(x) = (A + die. 
Xr (ur) = B, + uL} for HL I MA {PaE 
xı(H) = Cı + pr} for p, > pi, and boths 
best-response pairs for u, € (2c/(1 - OA 
If, as a refinement, we require continu} tg 
the P+(x) function, the full characterizatiee 
equilibrium is: E 


B, + bu for pp = M- g 
x(u) = 


CL + uL for EL >H; 


iP 





and 
0. for x = -Ald - 
ATU ONY ey 
Pj= 
< (47 — Ald 
1 for x = (40-4 
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as (A — dx)exp E J 


= (Á + doexp| - a ; 


É Taking the logarithms (a monotonic transfor- 
Salon) of both sides of the inequality, we ar- 


B 
as y* + In(A — dx) 
z— s, + 1n(A + dx), 


F OF PROPOSITION 1: 
tice that Panels C and D of Figure 2 


for the curves to be as presented, it 

"ai that the interior regions (x2, xz) 

a 35) overlap, or x} = (42 — Ayd > (A - 
R» Which is'to‘say 4@>A2! 0 U 


the tailed drawing of Figure 2, it matters 
than, equal to, or greater than x2, the 
0. The figure is drawn with 
2¢ > A, which is stronger than the 


We now proceed to characterize F(x). Note. 
that F(0) = 0, that F(x) > © as x > x? from 
above, and that F(x) > 
below. Hence, we know from continuity, that . 
F(x) = 0 at least three times, at x = x € (x, 
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0), at x = 0, and at x = x € (0,.x9). 


We can verify that these are indeed the three 


crossings by analyzing 


at ees 


Notice that F’(x) = 0 at 


(A4) 


Since Vd < AB, F’(x) 
roots, one positive and one negative, which are 


d d 
(A — dx) (A+ dò’ 
1 [vd 
d AB” 
= 0 has exactly two real 


equidistant from zero. 


Also, 


F(x) = — 


mission in Debate.”’ 


{| 


It is easy to see that F’(x) < 0 for x > 0 and 
that F(x) > 0 for x < 0. This implies that the 
negative root of (A4) corresponds to a mini- 
mum of F(x), and that the positive root of (A4) 
corresponds to a maximum of F(x). Hence, in 
the range (x, xR), the slope of F(x) switches 
Signs three times. Since F(x) is positive near 
the lower end of this range, negative near its 
upper end, and 0 at x = 0, it follows that F(x) 
has exactly three roots and that Figure 2, Panel 
D, correctly represents the curve. Since the fig- 
ure and the proposition are equivalent, this 
completes the proof. 
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The Efficiency-Enhancing Effect of Auctioning the Right to Pi 
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What Price Coordination? 


By VINCENT CRAWFORD AND BRUNO BROSETA * 


« + 


3 awe me mene 
Ss s 


A model is proposed to explain the results of recent experiments in which subjects 
repeatedly played a coordination game, with the right to play auctioned each 
period in a larger group. Subjects bid the market-clearing price to a level re- ` 
coverable only in the efficient equilibrium and then converged to that equilibrium, 

although subjects playing the game without auctions converged to inefficient 
equilibria. The efficiency-enhancing effect of auctions is reminiscent of forward i 
induction, but is not explained by equilibrium refinements. The model explains it 
by showing how strategic uncertainty interacts with history-dependent learning ` 


dynamics to determine equilibrium selection. (JEL C73, C92, C51) 


Coordination is central to many questions in 
economics, from the determination of bargain- 
ing outcomes to the design of incentive 
schemes, the efficacy of implicit contracts, and 
the influence of expectations in macroeconom- 
ics. Such questions are usually modeled as 
noncooperative games with multiple Nash 
equilibria, and analyzed under the assumption 
that players can realize any desired equilib- 
rium. Yet this begs the questions of whether, 
and how, coordination comes about and 
how the environment influences equilibrium 
selection—questions that lie at the heart of 
most applications. 

Although recent advances in game theory 
have added much to our understanding of co- 


* Crawford: Department of Economics, University of 
California, San Diego, 9500 Gilman Drive, La Jolla, CA 
92093; Broseta: Department of Economics, University of 
Arizona, McClelland Hall, Room 401, Tucson, AZ 85721. 
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Costa Gomes, Yong-Gwan Kim, Mark Machina, Daniel 
McFadden, Ronald Oaxaca, Alvin Roth, Jason Shachat, 
Joel Sobel, Glenn Sueyoshi, John Van Huyck, and two 
referees for helpful comments; Miguel Costa Gomes and 
Mahesh Johari for research assistance; Raymond Battalio, 
Richard Beil, Gerard Cachon, Coliti Cafnerer, and John 
Van Huyck for sharing their experimental data: Raymond 
Battalio, John Van Huyck, and Roberto Weber for con- 
ducting additional experiments for us; and the National 
Science Foundation, the Centro de Formación del Banco 
de Espafia, and the College of Business and Public Ad- 
ministration, University of Arizona, for research support. 


called efforts, and an order statistic of all 
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ordination, there remains a large gap be af 
theory and experience that is unlikely tok $9 
closed by theory alone. Further progress 


pirical evidence on strategic behavior. 
iments are a particularly useful source o 
evidence, in part because they make it po 
to observe the entire coordination pr 
Crawford (1997) surveys a number of ret 
studies in which subjects repeatedly played 
ordination games, uncertain only about 
other’s strategy choices. The typical result 3 
convergence to an equilibrium, often WiRt 
systematic pattern of equilibrium selecti 
the limit. Explaining such patterns promis 
shed considerable light on coordination, $ 
field as well as the laboratory. 
In this paper we seek to explain some, 
most intriguing evidence on coordinato 
have seen, from the experiment of Johi. yi 
Huyck et al. (henceforth ‘“VHBB”’) 
Their subjects repeatedly played a mn 
coordination game with seven sy 


best responses determined by their own & 


subjects’ efforts, in this case the me 


der statistic was the minimum. 












































= instead of endowing nine subjects 
fight to play the game as in their 1991 
nts, WHBB auctioned its nine posi- 
each period in a group of 18. The structure 
icly announced at the start; the market- 
g price and median effort were publicly 
-ed each period; and explicit commu- 
f pe gee rohibited throughout. 
ect a aos (1993) design is of unusual eco- 
s--interest. Its 1991 precursor, in which 
jects played the same median coordi- 
asion game without auctions, is among the 
st models of the emergence of conven- 
solve coordination problems, with a 
f possible outcomes and a natural mea- 
ene of their efficiency. The median game is 
milar in structure to several important mod- 
2gfrom the Stag Hunt example Rousseau 
to motivate his analysis of the social con- 
Fact: to Keynes’s beauty contest analogy and 
$z inore prosaic macroeconomic coordination 
godels surveyed in Russell Cooper and 
3 w John (1988). Its efficient equilibrium 
sw plainly the “‘correct’’ coordinating princi- 
, but it is best for a subject to play his part 
Ff that equilibrium only if he thinks it likely 
gy ai enough other subjects will do so. As in the 
Baker coordination games VHBB (1990, 
991) studied, there is a tension between the 


rium, ` 
icient ` 
ward 
zins it 
rning 


> gap bel 
inlikely't 
rogress seug 
ieory wit 
havior. E 


mber offe 


1y abon J Sgher payoff of the efficient equilibrium and 
1. often Wie £0: greater robustness of other equilibria to 
at selectiOeg is uncertainty about each other’s re- 
ms promi i: Sp-which we call strategic uncertainty. 
dination ly, ‘auctioning the right to play is an in- 


£ form of preplay communication, in 
‘Subjects’ willingness to pay may signal 
ey expect to play, and thereby 
ite the tension due to strategic uncer- 

€ auctions also capture important as- 
of “general equilibrium” analogs of 
8 earlier environments, in which play- 
S00se among coordination games with the 
-clearing price analogous to the op- 
own gas: oe determined by their best 
ction is an unusual form of preplay commu- 
rhea messages can directly influence 
à ce are not ‘‘cheap talk’’; and they are 
=e only through an aggregate, the market- 
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` Auctioning the right to play the median 
game had remarkable consequences. When - 
that game was played without auctions in 
VHBB’s (1991) experiment, most subjects 
initially chose inefficiently low efforts, and six 
out of six subject groups converged to ineffi- 
cient equilibria. Auctions might be expeeted 
to yield more efficient outcomes simply be- 
cause subjects have diverse beliefs about each 
other’s efforts, auctions select the most opti- 
mistic subjects to play, and optimism favors 
efficiency in the median game. But VHBB’s 
(1993) subjects did much better than this ar- ` 
gument suggests: In eight out of-eight groups, 
they bid the market-clearing price to a level 
recoverable only in the efficient equilibrium 
and then converged to that equilibrium.’ Con- 
vergence was very fast, essentially complete 
within three to five periods. The limiting out- 
come was consistent with subgame-perfect 
equilibrium in the stage game consisting of the 
auction followed by the coordination game, 
and subjects’ behavior suggests that their be- 
liefs were focused as in the intuition for for- 
ward induction refinements, in which players 
infer from their partners’ willingness to pay to 
play a game that they expect their payoffs to 
repay their costs, and intend to play accord- 
ingly (Elchanen Ben-Porath and Eddie Dekel, 
1992).4 

The efficiency-enhancing effect of auction- 
ing the right to play in VHBB’s (1993) ex- 
periment suggests a novel and potentially 
important way in which competition might 
promote efficiency. Although it conveys a 
powerful impression by itself, it raises as many 
questions as it answers. Was the outcome 


* Roberto Weber (1994) has replicated this result in a 
closely related environment, and Gerard Cachon and Colin 
Camerer (1996) have verified its robustness and refined its 
interpretation. Their experimental designs and results, and 
the extension of our analysis to explain them, are dis- 
cussed in Crawford and Broseta (1995). 

4 The intuition for forward induction seems to favor the 
efficient équilibritiim, arid subjects’ limiting behavior was 
consistent with forward induction, as it is usually formal- 
ized, as well as subgame perfectness. But both refinements 
are consistent with any of the seven symmetric pure- 
strategy equilibria in the coordination game (with full sur- 
plus extraction in the auction), so neither helps to explain 
VHBB’s result. f 
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VHBB observed inevitable in their environ- 
ment? How would the strength of the effect 
vary with the treatment variables —the 
number of players in the auction and the co- 
ordination game and the order statistic that de- 
` termines the robustness of the efficient 
equilibrium — which never varied in the 


experiment? Would the effect exténd t6’envi- ~ 


ronments beyond those that directly generalize 
VHBB’s designs? 

In principle such questions could be an- 
swered piecemeal by further experiments, but 
only a theory that elucidates the mechanism 
behind VHBB’s result can provide a firm basis 
for generalization and realize its full power to 
inform analysis. Such a theory can change the 
way we think about many applications in 
which the participants in coordination games 
are determined by a sorting process like 
VHBB’s auction or its general equilibrium 
analogs. 

The models in Cooper and John (1988), for 
instance, view the entire economy as a coor- 
dination game, but it may be more realistic to 
view it as composed of sectors, regions, or 
firms, each of which is a coordination game. 
Because participants must often choose among 
these, the economy may be closer to VHBB’s 
auction environment than the games with ex- 
ogenous participation of their earlier experi- 
ments. If so, VHBB’s result suggests that 
modeling the economy as a single coordina- 
tion game systematically overstates the power 
of coordination failure to explain underem- 
ployment. Alternatively, consider a coordina- 
tion model of Hong Kong’s economy that 
takes into account the high costs most inhab- 
itants paid to get there.” Did Hong Kong do so 
well by attracting entrants who expected effi- 
cient coordination, as in our ‘‘optimistic sub- 
jects” effect; via the power of entry barriers 
to focus beliefs on better coordination out- 
comes, as in our ‘‘forward induction’’ effect; 
by having rules that favor efficient coordina- 
tion, perhaps as in the “‘robustness’’ effect dis- 
cussed below; or simply by exploiting its 


5 We are grateful to a co-editor for suggesting this il- 
lustration, which is closer to VHBB’s auction environment 
because Hong Kong’s position has been nearly unique. 
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` mined their payoffs and best responses, | af 




































natural advantages and ‘‘skimmj 
cream’’ from mainland China? Only a4 
that distinguishes and quantifies th 
fects can give convincing answers to 
questions. 

In this paper we propose a theory to ex, h 
VHBB’s (1993) result, generalizing our 
yses of VHBB’s (1990, 1991) resul 


to a class of coordination games with ances iat 
in which VHBB’s treatment variables cay 
arbitrary values.° The main difficulty our i 
ysis must address is explaining equilibriu 
lection; the rapid convergence to equilj 
in the stage game in VHBB’s (1993) ¢ 


In VHBB’s (1990, 1991) experimen $? 
dynamics and limiting outcomes varied jy 


tematically with the size of the groups plays 
the game and the order statistic that ¢ 


strong downward drift and convergence tate 
equilibrium with lowest effort in VHBE} 
(1990) large-group minimum games, bitk 
drift and consistent ‘‘lock-in’’ on the ink 
median in VHBB’s (1991) median g 
These and other variations in equilibrium 
lection across VHBB’s treatments disci 
nate sharply among alternative theories 
strategic behavior, both traditional id 
adaptive. 
Our analyses explained these variatio 
persistent effects of interactions between 
tegic uncertainty and history-dependent leat 
ing dynamics. One would expect the le 
strategic uncertainty to decline grad Ad 
zero as players learn to forecast each 0 
responses. Unless this decline is very slo 
learning dynamics lock in on a particulat 
librium in the limit, and the model’s 7 
cations for equilibrium selection ¢ 
summarized by the prior probability dist 
tion of that equilibrium, which is 00 nit 
nondegenerate due to the persistent € 
strategic uncertainty. The distribution 


ing 
£ Yong-Gwan Kim (1996) suggests interest of 
native explanations of VHBB’s result, based 08 k 
dynamics and axioms in the spirit of evolutionary 
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-nilibrium is determined by players’ 
ug a treatment variables, and the 
‘ve effects of strategic uncertainty on 
smamics. The key to our explanation is 
strategic uncertainty generates what we 
saj] a robustness effect, which gives the 
dynamics a negative, zero, or positive 
or order statistics respectively below, at, 
e.the median. The magnitude of the 
increases with the level of strategic un- 
„ and for sufficiently large initial lev- 
déc ining over time, it makes the prior 
sility distribution of the limiting equilib- 
across VHBB’s (1990, 1991) treat- 
aisgmuch as its empirical frequency 
ribution varied in their experiments. 
‘Were we adapt our earlier methods to show 
i VHBB’s (1993) environment, strategic 
gecertainty generates robustness, optimistic 
‘gabiects, and forward induction effects large 
ieħ to explain the pattern of equilibrium 
letion they observed. Our approach yields 
tnified explanation of VHBB’s (1990, 1991, 
4993) results, and makes it possible to assess 
likely importance of those effects in other 
onments. 
€‘assume throughout that players are ra- 
anal in the standard sense that their decisions 
‘Baxlmize expected payoffs, given their beliefs 
uit each other’s decisions. But our analysis 
tive in that, rather than assuming equi- 
üm in the stage game or the repeated game 
scribes players’ entire interaction, we 
aplayers to have diverse beliefs about 
er’s decisions in the stage game and 
e process by which their beliefs and 
HS=converge as they learn to predict 
er’s decisions. Thus, our analysis al- 
quilibrium to be reached by unsophis- 
eaming from actual observation rather 
ctive, strategically sophisticated 


Mapose the structure needed for analysis 

Special features of VHBB’s environ- 
sive a simple econometric character- 
of Strategic uncertainty and the 
dynamics, which allows a more in- 
analysis of the effects of strategic 

than now seems possible for 
eneral. Players’ initial beliefs and 
new observations each period are 


volutionaly, 
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perturbed by independently and identically 


- distributed (henceforth, ‘‘1.i.d.’’) random 


shocks. The shocks represent strategic uncer- 
tainty in terms of the differences in players’ 
learning rules, and their variances represent 
the initial level of strategic uncertainty and 
how it varies over time as plaYers léarn to fore- 
cast each other’s responses. In effect each 
player has his own theory of coordination, 
which gives his initial beliefs and his interpre- 
tations of new observations an unpredictable 
component. 

The resulting model is a Markov process 
with time-varying transition probabilities, the © 
dynamics of which are driven by strategic un- 
certainty. Like our earlier models, it encom- 
passes the leading alternative characterizations 
of strategic behavior in such settings: tradi- 
tional analyses in which players’ beliefs and 
decisions are focused from the start on a par- 
ticular equilibrium, evolutionary analyses of 
the ‘‘long-run equilibria’’ of ergodic dynamics 
with small amounts of noise, and history- 
dependent learning with lock-in on a particular 
equilibrium in the limit. These are distin- 
guished mainly by different values of the var- 
iances that represent the level of strategic 
uncertainty and how it varies over time. 

As in our earlier analyses, we take the vari- 
ances and certain other aspects of behavior to 
be exogenous behavioral parameters. Our 
characterization provides a framework within 
which to close the model by estimating them, 
using VHBB’s (1993) experimental data.’ As 
before, the estimated parameters generally sat- 
isfy the restrictions suggested by theory, but 
differ significantly from the values needed to 
justify an equilibrium analysis or an analysis 
of the long-run equilibria of ergodic dynamics. 
Instead they indicate large initial levels of stra- 
tegic uncertainty, declining to zero over time. 
The decline is rapid enough to make the 


7 The need to proceed this way will come as no surprise 
to anyone who accepts Thomas Schelling’s (1960) prem- 
ise that the analysis of coordination is inherently partly 
empirical. Strategic uncertainty is crucial to understanding 
the dynamics, but the differences in subjects’ beliefs can- 
not be reliably explained by theory alone because sub- 
jects were indistinguishable and had nearly identical 
information. 
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dynamics converge with probability one to one 

of the pure-strategy subgame-perfect equilib- 
ria of the stage game, so that the model’s im- 
plications for equilibrium selection can be 
summarized by the prior probability distribu- 
tion of that equilibrium. The estimated model 
gives an adequate statistical summary of in- 
dividual subjects’ behavior, and implies prob- 
ability distributions of the dynamics and 
limiting outcome that resemble their em- 
pirical frequency distributions in VHBB’s 
experiment. 

We study equilibrium selection in more de- 
tail by using special features of our learning 
model and the environment to obtain a closed- 
form solution for the histories of players’ be- 
liefs, bids, and efforts as functions of the 
behavioral parameters, the treatment variables, 
and the shocks that represent strategic uncer- 
tainty. The solution shows how the limiting 
outcome is built up period by period from the 
shocks, the effects of which persist indefi- 
nitely. This persistence makes the learning 
process resemble a random walk in the aggre- 
gate, but with declining variances and nonzero 
drift that depend on the parameters and treat- 
ment variables. As in our earlier analyses, 
large, persistent effects of strategic uncertainty 
make it necessary to analyze the entire learn- 
ing process to understand equilibrium selec- 
tion, and preclude explanations based on 
equilibrium refinements or ergodic dynamics, 
which assume away either strategic uncer- 
tainty or persistence. 

The form of our solution indicates that un- 
less the behavioral parameters vary sharply 
and unpredictably with changes in the envi- 
ronment—which our estimates for VHBB’s 
(1990, 1991 )experiments suggestisunlikely — 
the dynamics and limiting outcome will vary 
with the treatment variables in stable, predict- 
able ways. We begin with a qualitative com- 
parative dynamics analysis of the effects of 
changes in VHBB’s (1993) treatment vari- 
ables. Generalizing results from our earlier 
analyses, we make precise, in the probabilistic 
` sense appropriate to the model, the common 
intuitions that coordination tends to be less ef- 
ficient the closer the order statistic is to the 
minimum (our robustness effect) and less ef- 
ficient in larger groups because it requires 





































coherence among a larger number of jp, 
dent decisions. We also establish a new iY fA 
showing that coordination tends to be mgs = 
ficient the more intense the competition 24) 
right to play. mae 
Our model explains the efficig, 
enhancing effect of auctions as the Tesulj 
dynamic interplay between forward indy. 
optimistic subjects, and robustness e 
Our solution allows a quantitative comp 
dynamics analysis that shows how the; 
nitudes of these effects are determined by 
treatment variables and behavioral parame 
and makes it possible to estimate their im 
tance in other environments. The mean 
dination outcome can be expressed as thé; 
of four components per period, one of 
is the forward induction effect, one of wks? 
combines the optimistic subjects effec 
the robustness effect of our earlier analyy$ 
and two of which are smaller effects, disci k; 
below. These components can be appi pas 
mated as known functions of the treatin}; 
variables, the behavioral parameters, andtă $} 
ulated statistical parameters. : 
These approximations yield a simple 
acterization of the optimistic subjects $; 
robustness effects. Together they have appr: 
imately the same magnitude in VHBBifs 
environment (where the right to play a 
person median game was auctioned ina 
of 18) as the robustness effect in an 18- 
coordination game without auctions in whe 
payoffs and best responses are determin 
the fifth highest (the median of the nine hig: 
est) of all 18 players’ efforts. In this respee 
the auctions effectively transformed VHB: $ 
median game, which without auctions was, 
have a robustness effect that contributes! 
drift to the dynamics, into a 75th-p 
game (0.75 = 13.5/18) with a robustes 
fect that contributes a large upward drift. y 
estimates suggest that this drift is respo% Æ 
for roughly half of the efficiency-enhatt® 
effect of auctions in VHBB’s environ 
and that the other half is due to a strong, 
ward induction effect. i 
Our quantitative analysis also sho 
unless the behavioral parameters vary 
with changes in the environment, the 
positive efficiency-enhancing effect 





























ihe class of environments we study, but it 
always strong enough to assure fully ef- 
t coordination. More generally, the 
ism suggests that the effect will extend 
“ether laboratory Or field environments that 
“mbine significant strategic uncertainty with 
‘he ingredients of our optimistic subjects and 
Z ward induction effects, 
<The rest of the paper is organized as follows. 
«tion I describes the environments we study 
and introduces our learning model. Sections II 
“gad IHI present the theoretical and econometric 


gaalyses. Section IV is the conclusion. 
I. The Model 


In this section we describe the class of envi- 
snaments we study and introduce the learning 
i model that is the basis for our theoretical and 
: esopometric analyses. We call the highest price 
a which a player plans to remain in the auction 
his bid, and we define efficiency with reference 
to players’ payoffs in the coordination game, 
Bearing in mind that the auction may transfer 
. Sime or all of the surplus to the experimenters. 


A. The Environment 


ia VHBB’s (1993) experiment subjects 
‘played the nine-person median coordination 
€é.0f VHBB’s (1991) treatment T for 10 

3-periods, with the right to play auctioned 
:petiod in a population of 18 subjects. In 
edian game nine players choose-among 
fforts, (1, ..., 7}, with payoffs deter- 


In this f d by their own efforts and the group me- - 
yrmed VHB ort. Denoting players’ efforts at time t 
auctio 3] “s 4, and the median by M,, player i’s 
ontributes payoff in dollars is 0.1 M, — 0.05(M, — 
75th-p .6. Because no player’s effort can in- 
1 robustil the median when all other players are 


a h i iguration of efforts with £, = M, for 
ef S an equilibrium, and these seven sym- 


a. These equilibria are strict and 
ed, with equilibria with higher M, 


also she all pl 
ers vary Players than those with lower M,. 
nent, thay On was a multiple-unit ascending- 


Clock as in Kevin McCabe et al. 


€quilibria are the only pure-strategy ` 
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(1990), in which subjects indicated their will- 


_ingness to pay the current asking price by 


Fy 


holding up bid cards, with subjects who 
dropped out not allowed to reenter the bidding. 
The asking price started five cents below the 
payoff of the least efficient equilibrium of the 
median game and increased by five-cent incre- 
ments until 11 or fewer subjects remained, af- 
ter which it increased by one-cent increments 
until nine or fewer subjects remained. (The 
payoffs of adjacent equilibria differed by ten 
cents.) The market-clearing price was deter- 
mined as follows. If the lowest price at which 
nine or fewer subjects remained in the auction 
left exactly nine subjects, all nine were 
awarded the right to play at that price. If that 
price left fewer than nine subjects, they were 
all awarded the right to play, with the remain- 
der of the nine slots filled randomly from those 
who dropped out at the last increase, and all 
nine subjects paying the price before the last 
increase® 

In VHBB’s (1993) design, as in their treat- 
ment T, explicit communication was prohib- 
ited throughout; the median was publicly 
announced after each play; and with minor ex- 
ceptions the structure was publicly announced 
at the start. The market-clearing price was 
publicly announced after each auction, before 
the winners played the median game. 

We now describe a class of environments 
that generalizes VHBB’s (1993) design by al- 
lowing any values of their treatment variables 
and any coordination game that shares the 
structural features noted above (including 
those used in VHBB’s [1990, 1991] other 
treatments). We assume throughout that bids 
are continuously variable.* We introduce the 
model under the simplifying assumption that 
effort is also continuously variable. In Section 
I, subsection D, and more formally in Section 
MI, subsection B, we explain how to adapt the 


_. = Thus subjects never paid more than they indicated 
they were willing to, and théy were never excluded in- 
voluntarily unless they had indicated approximate indifference. 

° This was not literally true in the experiments, but the 
increments by which bids could vary were smail enough 
(one cent near the market-clearing price, compared to a 
ten-cent payoff difference between equilibria) to make it 
a good approximation. 
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model to the discrete effort spaces of VHBB’s 
experiment by viewing the continuously vari- 
able efforts as the latent variables in an ordered 
probit model of discrete effort choice. 

There is a finite number, m, of indistin- 
guishable players, who repeatedly play a 
stage game with symmetric player roles. 
The stage game consists of an n-person 
coordination game, with n < m, preceded by 
an auction in which all m players bid for the 
right to play. ` 

The coordination game has one-dimensional 
strategy spaces, with strategies called efforts. 
Effort has a commonly understood scale, 
which makes it meaningful to say that a player 
chose the same effort in different periods, or 
that different players chose the same effort. 
Any symmetric effort combination is an equi- 
librium, and such combinations are the only 
pure-strategy equilibria. By symmetry, these 
equilibria are Pareto ranked unless players are 
indifferent between them. Each player’s best 
responses are given by a summary statistic of 
all players’ efforts when that statistic is unaf- 
fected by his effort. The summary statistic, y,, 
is a function f ( Xir, ... , Xar), Where x; is player 
?s continuously variable effort at time ¢ and 


players 1, ... , play the game. We assume that 
fC) is continuous and, for any Xir, ... , Xn and 
constants a and b = 0, f(a + bx, .., a + 


DXn) = a + Df (X11, «5 Xm). These assump- 
tions are satisfied when f(-) is an order statis- 
_ tic or a convex combination of order statistics 
such as the arithmetic mean. To see what they 
entail, note that the symmetry of roles implies 
that f(-) is a symmetric function of the x; so 
that its value is determined by the order statis- 
tics of their empirical distribution. Our as- 
sumptions rule out most nonlinear functions of 
these order statistics, which is restrictive but 
probably not unrepresentative of symmetric 
games. We abuse terminology by calling f (C) 
an order statistic. 

The auction is a multiple-unit ascending-bid 
English clock as described above, with player 
i’s bid and the market-clearing price at time t 
denoted p; and qn» respectively. We depart 
_ from the auction VHBB used only in assuming 
that p, and q, are continuously variable. With 
continuously variable bids the effects of 
VHBB’s tie-breaking procedure are negligi- 












ble, and q, can be taken to be the (n 
largest of the p,, an order statistic of the" 
pirical distribution. For any given m and 

can be written as a function, 


(1) qı = 8(Pin sas Pmi), 


‘where for any py,, ... 


> Pm and any COnstan: 
and b = 0, aiy 


(2) — g(a + bpy, ..., a + bpm) 


a+ bg (Pin eee >» Pmt). 


Because the form of g(-) is completely di: 
mined by m and n, we describe chan 
g(-) below by describing the assoc 
changes in m and n, without separate ref 
to g(-). 

The outcome of the stage game can now 
described more precisely. We write 
(3) yy = ACK, « 


»Xmrs Pin + , Pmi), 


where the x, now include planned efiă $% 
choices for all m players, and A(x, „u, KÈ 
Pits +5 Pme) equals f(-) evaluated at the Ee: 
sociated with the n highest p;, in the popie 
tion— noting that, the order of these 4# 8i 
immaterial and ignoring ties in the py, Wit 
will have probability zero under our disttté 
tional assumptions. It is immediate from 
definition that for any Pir, ... , Dm and any 
stants a and b = 0, h(-) is continuous 

> Xt and inherits the scaling prm 
FC ), in that 


(4) h(a t+ bx, 0, A + DX} Pin = 


(5) A(x, -. 


= h Xirs o3 Xnr Piss oo 


We assume that the structure of the en’, 
ment is publicly known. 
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B. Equilibrium Refinements 
in the Stage Game 


oted above, in VHBB’s (1993) exper- 
t out of eight subject AO at the 
ing price of the right to play to a 
saunas only in the efficient equilib- 
and their efforts converged to that equi- 
mm. AS VHBB’s (1993) Tables V and VI 
lear, subjects’ bids and efforts were 
é y consistent with the intuition for for- 
duction, in that individual subjects sel- 
‘aam bid more than their efforts made it 
nassible to recover in the coordination game, 
“gid ever did so after the first few periods. 
‘Although we shall argue below that the lim- 
inge tcomes in the experiment cannot be un- 
00d without analyzing history-dependent 
g dynamics, it is instructive to consider 
ge implications of equilibrium refinements in 
‘tha stage game. Because the structure was pub- 
‘Gely;announced, we assume complete infor- 
mation, The most relevant refinements are 
subgame perfectness and forward induction, 
“gad the essential points can be seen by focus- 
ise. of symmetric pure-strategy equilibria. 
“Wukout loss of generality we rescale the pay- 
‘sifa In the coordination game, which are in- 
gesing in players’ common efforts, so that in 
“lof these equilibria players’ payoffs equal 
efforts. 
en all players use the same bidding strat- 
Sey andn < m, no player can alter the market- 
4 ig price by changing his bid. It is easy 
eck that any strategy combination with 
4 = q, = y, for all i, in which each player 
# his payoff in a symmetric pure-strategy 
librium of the coordination game and then 
at equilibrium’s effort unless he is the 
“Player to bid that amount, is a subgame- 
“t equilibrium of the stage game.’° 


layer can gain by changing his bid because the 
8 play yields zero surplus. Other symmetric pure- 
e-perfect equilibria differ from these only 
ea in which the right to play yields positive 
a player could gain by outbidding his part- 

ng self the right to play without raising the 
ng price. Kim (1996 Lemma 1) gives a full 

r thie of the pure-strategy subgame-perfect 
game, including asymmetric ones. 


j» off the equilibrium path. There are no sym- | 
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In applying forward induction we follow 
Ben-Porath and Dekel (1992) in identifying it 
with the ability to survive iterated deletion of 
weakly dominated strategies. VHBB’s stage 
game resembles Ben-Porath and Dekel’s Fig- 
ure 2.3b example, in which two players pub- 
licly and simultaneously decide whether to 
burn a given quantity of money before playing ` 
a coordination game like VHBB’s.'' This 
game has a symmetric subgame-perfect equi- 
librium in which players burn money and then 
play the inferior equilibrium in the coordina- 
tion game, each anticipating that equilibrium 
unless he is the only player who burns money. 
In this equilibrium each player’s strategy is a 
unique best response given his beliefs, so it 
‘survives iterated deletion of-dominated strat- 
egies. As Ben-Porath and Dekel note, such 
equilibria normally exist whenever players 
move simultaneously in the communication 
phase. In VHBB’s stage game such an equi- 
librium can be constructed to support any of 
the seven symmetric pure-strategy equilibria 
in their coordination game, with full surplus 
extraction in the auction. The requirements for 
such equilibria are also consistent with sub- 
game perfectness. Thus, subgame perfectness 
and forward induction in the stage game, to- 
gether or separately, are consistent with the 
limiting outcomes in VHBB’s experiment but 
too unrestrictive to help in explaining them. 
We use ‘‘forward induction’’ loosely below to 
refer to the intuition rather than a formal 
definition. 


C. The Learning Model 


For our purposes, it is essential to capture 
the idea that, even if players form their beliefs 
and choose their bids and efforts sensibly, they 
may differ in unpredictable ways. It is also es- 
sential to describe the dynamics of beliefs, 


1! VHBB’s stage game differs from Ben-Porath and 
Dekel’s example in that the latter’s players must play the 
game whether or not they burn money, and must beaf the 
cost of any money they burn no matter what their partners 
do; but these differences are inessential here. Note that 
unilateral deviations from symmetric bid combinations are . 
always observable, although deviations from asymmetric 
combinations might not be. 
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bids, and ‘efforts realistically in terms of ob- 
servable variables, in a way that permits esti- 
mation of the behavioral parameters that 
cannot reliably be determined by theory (such 
as the variances that represent the level of stra- 
tegic uncertainty) and allows an informative 
theoretical analysis. 

- In the environments we study, players’ bids 
and efforts evolve as follows. First all m play- 
ers choose their initial bids, the resulting value 
of q, is publicly announced, and the n winners 
choose their efforts in the coordination game." 
All m players then observe the resulting value 
of y, and choose new bids, and the process con- 
tinues. Because the structure is public knowl- 
edge, players face uncertainty only about each 
other’s bids and efforts. The effects of others’ 
bids and efforts on a player’s payoffs are fil- 
tered through q, and y,. 

The large subject populations in VHBB’s 
(1993) experiment make it a plausible work- 
ing hypothesis that players treat their individ- 
ual influences on q, and y, as negligible. This 
implies that players’ optimal bids and efforts 
each period are determined by their current 
payoff implications, and thus by their beliefs 
about the current y,, because those beliefs de- 
termine their expected payoffs and optimal ef- 
forts, and their bids make winning the right to 
play contingent on g,. We imagine that players 
begin with prior beliefs about the process that 
generates q, and y,, use standard statistical pro- 
cedures to revise their beliefs in response to 
new observations, and choose the bids and ef- 
forts that are optimal, given their beliefs. Play- 
ers whose priors differ may then have different 
beliefs even if they have always observed the 
same history and used the same procedures to 
interpret it. 

We adopt a simple econometric characteri- 
zation of the dynamics of beliefs, bids, and 
efforts in the style of the adaptive control lit- 


'? We assure that bids: are not publicly~ observable, 
even though VHBB’s subjects were in booths that may 
have allowed them, with some effort, to observe when 
other subjects dropped out. At our request Weber (1994) 
ran two trials that replicated VHBB’s design but ensured 
that bids were unobservable, with very similar results. 



























erature.'? The key insight of the contrgj: 
ature is that describing how beliefs reg, 
new information does not require repy 
them as probability distributions or th 
ments. It is enough to model the dynamig; 
the optimal decisions they imply, whi 
the only aspect of beliefs that directly af 
the outcome. We represent players’ bétief 
their optimal efforts, which when contin 
variable preserve enough information to; 
istically describe their bids as well as th Pi 
forts.'* There is no need to assume thag 
optimal efforts are determined in any sity 
way by the moments of the distributing gg. 
describe players’ beliefs. 
We describe the conditional means of j 
ers’ responses to new observations by ; 
linear adjustment rules, in which their belig F i 
adjust part of the way toward the valu f 
gested by the latest observation of y, in ay Yig 
that generalizes the fictitious-play and 
response rules to allow different values o 
rameters that represent the initial levek $ 
trends, and inertia of beliefs. We describe & $f% 
differences in players’ beliefs by perturhy $ 
these mean adjustments by idiosyncratic; 
dom shocks each period, which representi $s 
tegic uncertainty described in terms of E 
differences in players’ learning rules. Wee ¥ 
sume that these shocks are i.i.d. across play 
with zero means and given variances. Ths 
any correlation in players’ beliefs that em 
is the result of responses to common 00s 
vations of y, and q, rather than an artifad# 
our distributional assumptions. 
We distinguish between players’ belé 
when they bid and when the winners 
auction play the coordination game, 
the latter beliefs. may reflect inferences 
qı. Player i’s beliefs in period t before ant. £ 
he observes q, are denoted X, and Xin SF 4 
tively. They are interpreted as his estimat 
his optimal effort given his informat%, 
those times, with X, and x, denoting the 4 
ciated population m-vectors. F 


‘3 


'3 See M. B. Nevel’son and R. Z. Has’ minskit 
or Michael Woodford (1990 Sec. 2). 
14 Our model with discrete efforts, by com"? 
scribes beliefs and efforts separately. 
































t NO. I 
contr players’ jnitial beliefs are given by 
3S Tespgp l 
represe Xio = Ho + io» 
or thei 


dyna "ere mis their common mean and the Cio are 
Y, which ae tic shocks. ee 

ectly-a idnsyn ine that payoffs in the coordination 
ts’ belii ene are measured so that players’ payoffs in 


conting g tric equilibria equal their efforts, play- 


ation to * ids in period f are given by 

Has the an 

ume that} = Z . 
; = : l t=0,... 

n any | Pa = By + AE The 0, ’ 

ributiog X 


the B, are constants and the 7;, are idio- 
“neratic shocks that represent the differences 


ms by ; ia players’ bidding strategies. This specifica- 


1 their aga can be justified as follows. Players’ values 
ne valu f the right to play may be affiliated, because 
of y, in with strategic uncertainty higher bids signal 
lay and ‘Higher beliefs and efforts, which raise the ex 
; values Of ante distributions of all players’ payoffs.'° But 
nitial leve Because players ignore their future influences 
e describe ‘aay, and their individual influences on q, are 
by perturbi . eegligible unless they do not win the right to 
isyncrati¢ ‘glay,standard arguments show that it is ap- 
represent SR proximately optimal for a player to stay in the 

terms Of sictlon until his value given the current level 


falls below q, [although q, influences the 
v Sütcome in the game, via (8) below, this cri- 


riances Uniquely defines his bid when 6, < 1].'° 
fs that em y because VHBB’s median game im- 


layers’ -P ‘ 

winne Milgrom and Robert Weber (1982) define affil- 
game, beam ven though winners play the same game, their 
afer ences is t¢-not common because their beliefs and efforts 

Bin i? 

efore and B's auction is a multiple-unit version of the 
and x; auction studied by Milgrom and Weber 
his es hey show that when players have affiliated values 
inform : gies each other’s bids, it is optimal for a player 
oting > until the current price makes him indifferent 


8 or losing, taking into account the win- 
However, in VHBB’s auctions winning re- 
ng toa player about other winners’ beliefs; and 
-AS aANT eam about losers’ beliefs is irrelevant 
Seifent value of th idi i 

eS one m VUE game unless the idiosyncratic 

Has’ e beliefs are correlated. This suggests that 
; ~eUTSe effects are unimportant here, and that it 

differe 


nce whether players can observe losers’ 
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poses a quadratic payoff penalty for efforts 
away from the median, a risk-neutral player’s 
optimal effort equals the mean of his distri- 
bution of y, by certainty-equivalence. His ex- 
pected payoff is then equal to that effort minus 


the variance of his distribution of y,, which 
_ should be approximately proportional to the 


variance of 6;, in (8) below. This yields (7), 
with 6,, the mean of players’ corrections for 
the cost of strategic uncertainty, negative and 
approaching zero over time if Var 0, — 0. 
After the auction players observe q,, as de- 
termined by (1), and update their beliefs to 


(8) Xi = Yı F 6G: + (1 g Ôr) Xir + bis 


t=0,..., 


where the y, and 6, are constants and the 9,, are 
shocks that represent the differences in updat- 
ing rules. y, and 6,, the level and slope param- 
eters of these linear adjustments, together 
reflect forward induction inferences from the 
observed value of q,. 

The winners then determine their efforts ac- 
cording to the x», and all m players observe 
the value of y, determined by (3) and update 
their beliefs again, to 


(9) Xirsi = ray E Sper yr +L — S41) Xie 


+ Grs t=0,..., 


where the a, and s, are constants and the G; are _ 
shocks. The level and slope parameters a, . | 
and «,,, represent any trends in beliefs and 
the precision of beliefs, as reflected by how 
much they respond to new observations of y,. 
The Xy+ ı then determine players’ bids next pe- 
riod via (7), and the process continues. 
Although (8) and (9) suggest partial ad- 
justment to the efforts suggested by the latest 
observation of q, or y,, they are best thought 
of as representing full adjustment to players’ 
current estimates of their optimal efforts, 
which respond less than fully to new obser- 
vations because they are only part of players’ 
information about the process. As explained 
in Crawford (1995 pp. 112-13), (9) gen- 
eralizes the familiar fictitious-play and best- 
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response learning rules to allow a much 
wider range of priors about the structure of 
the y, process.'’ Equations (8) and (9) are 
not fully ‘‘rational’’ in the game-theoretic 
sense, because players’ priors are not re- 
quired to be correct and linearity may be in- 
consistent with the rules that are optimal 


given their priors~However, in the control - 


literature such rules have been shown to pro- 
vide a robust estimation method for agents 
who understand the forecasting problems 
they face, but are unwilling to make the spe- 
cific assumptions about the structure of the 
process or unable to store and process the 
large amounts of information a Bayesian ap- 
proach requires. 

By construction, EG, = En, = E0, = 0 for all 
i and t, where E is the expectation operator. We 
assume that Gi, Nu, and 6, are independent of 
each other and serially independent for each i, 
which amounts to assuming that x; and x; fully 
capture the future effects of past idiosyncratic 
influences on player i’s beliefs. We also assume 
that Cir, Ni, and 9;, are i.i.d. across i, with com- 
mon variances denoted ¢?,, o7,, and o%. 

We assume throughout that 0 < 6,, €, = 
1. With no further restrictions on the behav- 
ioral parameters, the model is formally con- 
sistent with any history of the y, for any m, 
n,andf(-), with a, varying over time as nec- 
essary and G,, Yi 02, O71, and o$, set equal 
to zero, so that for all i and t, Ppa = Xp = 
Xin = qi = Ye = B5=0 Gs. Such solutions, in 
which players jump from one stage-game 
equilibrium to the next following some com- 
monly understood pattern, are not ruled out 
by standard arguments because. they corre- 
spond to equilibria of the repeated game. Yet 
they are empirically bizarre, because the ad 
hoc time variation in the behavioral param- 
eters they require violates the hypothesis 
that, in a given environment, the distribution 
of past behavior is indicative of current be- 


17 Unlike learning rules in which players respond only - 


to realized payoffs, such as those in Alvin Roth and Ido 
Erev (1995), (8) and (9) reflect the best-response structure. 
As explained in Crawford (1997 Sec. 6.3), VHBB’s sub- 
jects seemed to understand the structure, and such rules 
allow a better description of their behavior. 
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havior, which most of us take for grantes: 
learning to predict others’ behavior 
peated interactions. 7 
We address this issue below by impo, paz 
intertemporal restrictions that rule Out sick: 
hoc variations in the model’s behavior É 
rameters. These restrictions allow for the 
“sibility that a,, Br, Yr Cto Cr OF Oasys 
co, so that players learn to predict y, if it a. 
verges and choose the x; that are optimal aire 
their predictions, and the long-run stes} 
states of the dynamics coincide with the w 
strategy subgame-perfect equilibria oft 
stage game [shown in Section I, subsectiggg $ 
to satisfy Pa = Xn (= Xin) = qi = y, for all 
We find below that they permit an adeg; 
econometric description of individual 
jects’ behavior, which yields a useful 
nation of the dynamics of their interact} 


D. Discrete Efforts 


To estimate the model and to use it to p: 
dict experimental results, we must allow fg 
the discrete efforts of VHBB’s design. Wed & 
this by letting x, and X, continue to repress 64 
beliefs and remain continuously variable, 
with x; determining effort as the latent vari 5 
in an ordered probit, described in Section Ë 
subsection B. Beliefs, bids, and efforts still $ 
low (1), (3), and (6)—(9), but with eff 
and the order statistic determined by ro 
each x; to the nearest feasible integer and 
uating A(-) at the rounded values. 


H. Analysis 


We begin by analyzing the model with 
tinuously variable efforts. In Section I 
section F, we explain how to extend: 
analysis to the discrete efforts of 
experiment. l 


A. Preliminaries 


Once the distributions of Gir, firs ands 
specified, (1); (3), and (6)- (9) de 
Markov process with state vector 
which beliefs, bids, and efforts af 
across players ex ante. Equation U { 
vides initial conditions and substituti®$ 




































d(7)- — (8) into (9) and using (2) and 
(5) gives ¥,+1 as a function of the <;,, 
; , and shocks: 


pp Orti + pôt Y: 


+ ô 8 (Xi. + Mirs + 3A + Nme) 


+ E+ AL (1 r 6) Xir + Oin Has 


l — 5, ) Xe + ints Xir = ita vee 3 Aint + Tin | 


+ (1 = G4 LC i) + Ge] 


+ Cres t=0,.... 


‘codel’s recursive structure and the inde- 
e of players’ deviations from the av- 
adjustment rules capture the requirement 
: players form their beliefs and choose their 
aratges independently that is the essence of 
eordination problem. 

The model’s dynamics are driven by stra- 
uncertainty, as represented by the o?,, 
da. Different assumptions about how 
E zn and ca vary over time have different 
elleations for the dynamics, which go a long 
Aoward identifying the stochastic struc- 
sz. The following result is helpful in under- 
“ending this relationship. 


ROPOSITION 0: Suppose that a, = 0 for 
ti; a and p, = y, = 0 for allt =0,. 
here are no exogenous trends in be- 


) that the rules that describe players’ 


onward for some T = 0. Then there 
‘players, j and k, such that 8l 
Git and hl Eiis ina Mees incs An) = De 
+3 and Xa and x, converge with 
ity ane to a common value, weakly be- 
re ait and Xer, that is completely deter- 

Lby Zr, Xr, and the ô, and e, for t = T, 

T= > 0, so that players’ initial beliefs 
as well” then Xin = Xu = Ao for all i 


f is in the Appendix. Proposition 


substituti that if there are no differences in 
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players’ responses to new information from 
period T on, the dynamics must converge to 
an equilibrium dictated by, and bounded be- 
tween, the Xy and x,; that determine g(-) and 


h(-) in period T. In VHBB’s environment x); 


and xX,7 are the tenth and fifth largest Xir (be- 
cause n = 9, g(-) is the (n + 1)th highest 


bid, and A(-) is the median of thé nine lafgest *~ ~*~ 


Xir). Thus, unless differences in responses 
persist long enough to drive Xr and X,7 to 
efficient levels, the model cannot explain the 
convergence to the efficient equilibrium 
VHBB observed." 

If o2, o%, and a2, converge to positive con- 
stants, instead of remaining at 0 for all t = T, 

, an analysis of long-run equilibria seems pos- 
sible, along the lines of the analysis of evolu- 
tionary dynamics in Kim (1996). This would 
probably reproduce Kim’s conclusion that there 
is efficient coordination in the long run, as in 
VHBB’s experiment. However, the closely re- 
lated experiments of Weber (1994) and Cachon 
and Camerer (1996) suggest that efficiency is 
not inevitable for all values of m, n, and f(-) 
(although no amount of data from experiments 
with finite horizons can ever disprove a long-run 


result). More importantly, we find below that: 


when o2, 7%, and o¢, decline gradually to 0, as 
one would expect as players learn to forecast q, 
and y, from their common observations, the 
model yields history-dependent dynamics that 
closely resemble VHBB’s results. And in this 
case the model also provides useful information 
about the mechanism behind the efficiency- 
enhancing effect of auctions and the effects of 
changes in the environment—information that 
would very likely be lost in an analysis of long- 
run equilibria. , 


B. Closed-Form Solution 


We now show how to analyze the dynamics, 
whether or not the variances converge to zero, 
in a way that yields a deeper understanding of 


a 


'§ This not the whole story because in our empirical 
analysis a,, 6, and y, sometimes differ from 0 and, as 
explained below, these parameters also affect efficiency. 
We set them equal to 0 here, except for a, to focus on a 
more subtle aspect of the dynamics. 


-r ee 
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-how outcomes are determined and- makes it 
` possible to estimate the effects of changes in 
< the environment. 

The key to much of our analysis is the fact 
that the scaling properties of the order statistics 
g(-) and h(-) and the form of players’ learning 
rules allow us, despite the model’s nonlinear- 
ity, to obtain a closed-form solution for the 
entire. history. of players’ beliefs, bids, and ef- 
forts as a function of the behavioral parameters 
and the shocks that represent strategic uncer- 
tainty. In what follows, sums with no terms 
(like 242 £, ih, for t = 0) are undérstood to 
equal 0, and products with no terms are un- 
derstood to equal 1. 


PROPOSITION 1: The unique solution of the 
learning dynamics is given, for all i and t, by 


(12) Xi = 
s=0 s=0 
t-—} 
+ 5 Es + 1h, F Zit 
s=0 
and 
t Es, 
(13) =) [a + pð + 71+ È 6:8 
s=0 s=0 
t=i 
+ 5 Es + 1Ms + h,, 
s=0 
where 
(14) &: = Elz + They sey Lint a: Nml» 
h, = h(zir, e.. Zm; Zir T Nits ... 9 Zit + Nme) 
(15) w= Y [C1 — 6,)Gis + bi] 
s=0 
i at (1 -— 6,-j4)0 - erin) 
j=l 
and Z, = (1 -— Er)Zi-1 + Cir- 


$ [a, + bós + ys] + È ôg. 


The proof is in the Appendix. Pro 
expresses the outcome of the learning 
the cumulative effect of trend terms a,, 
y, and shock terms 6,g, and Ers 1h,; the eff. 
which persist indefinitely.'? This pers 
makes the process resemble a random w 
the aggregate, but with nonzero drift and ġ 
ing variances determined by the parameters 
the treatment -variables. As explained in: 
I, subsection C, œ and the a, represent the 
erage level of players’ initial beliefs ang 3: 
trends in their beliefs, the 8, correct their bj 
rules for the expected cost of strategic 
tainty, and the y, reflect their average fory 
induction inferences on observing q,. The 
reflect the effect of dispersion in players’ 
on their beliefs via the market-clearing prj Ste 
The ¢,4,h, reflect the robustness effect of & Ss 
dispersion of the winners’ efforts via y, dis 
acterized in Broseta (1993a Proposition: 
and Crawford (1995 Proposition 1), modif $ 
here by the optimistic subjects effect. Props 
tion 1 is valid no matter how the shocks are ge $ 
erated, but much of its usefulness stems frome i 
fact that the shock terms are known functions 




























ag 


as 


i for any t. This and the scaling properti 
g(-) and A(-) are crucial to the comparatved 
namics analysis below. 


C. Convergence 


In the next proposition, for technical reas % 
we bound players’ efforts. This is done by 
creasing xy to its lower bound, denoted x, &F $ 
ducing it to its upper bound, denoted x, whet 
would otherwise fall outside the interval [44 


PROPOSITION 2: Assume that the Xi 
bounded, so that they remain in the inl 
(x, x]. FÈS- 0 Qs, 25 =0 bs, 2 s=0 Ys: 
Ts, Ès=0 CZs, ANd Vou Gz, are finite 
0 < ô, ¢, = 1 forall t, with (1 — oU 


F E r 
'? The remaining terms, z; in (12) and h 18 Cyan Xt.. 
subsumed in the shock terms after the period in W È 
first appear; and the remaining endogenous variate 
Pin and qn are easily computed from x; an dete 
(1), (7), and (9). 



















































the X, and Xu converge, with prob- 
to a common, finite limit. 


‘ons under which the learning dy- 
‘giao to the bids and efforts asso- 

th one of the symmetric pure-strategy 
ect equilibria of the stage game. 
‘4el’s implications for equilibrium selec- 
cite best summarized by the prior proba- 

y distribution of that equilibrium, which by 
‘ion 1 is typically nondegenerate due to 
stent effects of strategic uncertainty. 
Proposition 2 is quite helpful in un- 
the model, its conditions are for 


bient; convergence can easily occur when 
‘<yiolated. The variance conditions are 
ie would expect from the strong law of 
a numbers, and probably cannot be weak- 
‘significantly if one insists on convergence 
‘probability one. However, the estimated 
gadel reported in Section MI, subsection E, 
iverges strongly to the upper bound x in 
BB's environment, even though our param- 
€ catar estimates violate Proposition 2’s conditions 
Raj the p, and y,. This suggests that our con- 
--dalons on the a, 8,, and y, could be weakened, 
Bat the additional insight this would yield 
sims unlikely to justify the effort. 


; via ydy 

position ym 
Riley... 

1), at 


‘Qualitative Comparative Dynamics 
Š 4 


ition 1 shows that unless the behav- 
eters vary sharply with changes in 


ioted X,“ t-e treatment variables, the dynamics and lim- 

interval [4E “388 Outcome respond to such changes in sta- 
% dictable ways. We now conduct a 

that th Fali ve comparative dynamics analysis of 

in the in m, n, and f(-), holding the behav- 

E o I ameters constant. 

are finit fine the jth order statistic of an em- 
( eeu (€),...,€,) as the jth small- 


&:; thus, the minimum is the first 
Stic and the median (for odd n) is 
/2ZYth. If f(-) is an order statistic, 
identifies which one; and if f(-) is 


mane Bi aex Combination of order statistics, jin- 
pena o € order statistics from which it is com- 


Increase in j’’ refers to any shift 
ghts used to compute f(-) that in- 


| f is in the Appendix. Proposition 2- 
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creases j in the sense of first-order stochastic 
dominance, and a random variable is said to 
“‘stochastically increase’ if its probability dis- 
tribution shifts upward in this sense. 


_ PROPOSITION, 3: Holding m and n and 


the behavioral parameters. constant, in- 
creasing j stochastically increases y, and Xy 
for alliandt. 


PROOF: 

The proof is immediate from Proposition 1, 
noting that for any history of shocks, increas- 
ing j increases h, for all t and leaves the g, 
unaffected. 


PROPOSITION 4: Holding m and j (or the 
weights on alternative values of j) and the 
behavioral parameters constant, increasing 
n stochastically decreases y, and x; for all.i 
and t. 


PROOF: 

The proof is immediate from Proposition 1, 
noting that for any history of shocks, increas- 
ing n (with i running from 1 to the larger value 
of n) decreases both g, and h, for all t. 


PROPOSITION 5: Holding n and j (or the 
weights on alternative values of j) and the be- 
havioral parameters constant, increasing m sto- 
chastically increases y, and x; for all i and t. 


PROOF: 

The proof follows from Proposition 1. It is 
clear that for any history of shocks, increasing m 
increases g,. It is not true that increasing m in- 
creases h, for any history, because such an in- 
crease may cause turnover among those who win ` 
the right to play. However, it follows from (8) 
and the fact that the 6, are iid. with mean zero 
that any turnover replaces players with players 
whose effort distributions first order stochastically 
dominate them (whether or not the x, are 
bounded ), which, stochastically increases h.. 


Propositions 3 and 4 extend Broseta (1993a 


` Lemma 3.1 ) and Crawford (1995 Propositions 


3—4), making precise, in the probabilistic 
sense appropriate to the model, the intuitions 
that coordination is less efficient when more 
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efficient outcomes are robust only to devia- 
tions by smaller sets of players, and less effi- 
cient in larger groups because it requires 
coherence among a larger number of indepen- 
dent decisions. Proposition 5 addresses a new 
issue, showing that increased competition for 


the right to play favors efficiency because it 
tends to yield-higher 


and intensifies the optimistic subjects effect. 


E. Quantitative Comparative Dynamics 


We now conduct a quantitative comparative 
dynamics analysis, showing how the mean co- 
ordination outcome is affected by changes in 
the treatment variables and the behavioral pa- 
rameters. The analysis requires some addi- 
tional notation. Let o?, and o? 


z» respectively, 
denote the common ex ante variances of the 


Pi (Or, equivalently, the Zi + Nx) and the zy. 
Because the ¢,, and 6, are seri ally independent, 
(15) and (7) imply that 


t 


È [0 ~ b,a}, + o3] 


sz 


A ts 2 
x [1 (1l =ó, na) -e| 


(16) ch = 


j=) 
and 
(17) of =(1- PROS + ot, +o. 


Finally, let 4, = Eg((Z, + ,,)lo 


pty sey (Zm + 
me)! Ops) and V, = Eh( 2/04, wary Zu! Cus (Zit 


F ie) / Ops a (Zin + Tim )/0,,). Because the 
random variables (Z3, + ,,)/ Op and Z;,/o, are 
standardized, with mean 0 and variance 1, 4, 


and v, are purely statistical parameters, com- 


pletely determined by the joint distribution of 


these random variables and the treatment vari- 
ables m,n, and f(-); they are subscripted be- 
cause that distribution is time-dependent, and 
their dependence on m, n, and f(-) is sup- 
pressed for clarity. u, and v, respond to 
changes in m, n, and f(-) in the expected di- 
rections, as can be inferred from the responses 


of g, and h, identified in the proofs of Propo- 
sitions 3~5. 


“market-clearing’ prices” ` ` 















PROPOSITION 6: The ex ante means’ 
and the x, are given, for all i and t, by .: 


(18) Exi = > [æ + B.6, + Ys] 
s=0 g 
t t-] 
+ D, Opss + > Es 410 
s=0 s=0 > 
and 
(19) Ey, od YX [æ + pôs T Ys] 
s=0 
! t-l st 
+ x 6, pshs + 5 Es+10zVs + 
s=0 s=0 
PROOF: 


Using (2), (4), (5), and (14) yields 


(20) Eg, = 


El ons8 ((Zis F Mis)! Ops» e (Zms pi Tins)! 0p 


O pss 


and 


Il 


(21) Eh, 
Elo,Eh( 21/02, athe: Zus! Ozs3 
(Zis F Ms)! Ops» ees 3 (Zms + Tins)! Gn) + 


= OzVs. 


Taking expectations in (12) and (13) 
yields (18) and (19). 


Proposition 6 expresses the mean cori 
nation outcome as a simple function F fi 
havioral parameters; the o,, and oz, Wi 
be computed from the behavioral pe a 
using (16) and (17); and the statistic : 
eters u, and v,, which reflect the influe?” 
m,n, and f(-) on the dynamics. To apps." 0:75 

‘result we must estimate the behavioral Eabh, 
eters and evaluate u, and v,. The esti i ; 
carried out in Section III. 4, and v, are 
to evaluate because their dependenc® 
distribution of the Zp + Na and Zy 18 ©: 


ible to approximate them, as we 



































roximations are based on further as- 
Sans. The first addresses a difficulty in 
g Eh, that arises because unless 07, = 
0, so that there is no dispersion in play- 
dding rules and their inferences on ob- 
the highest bidders need not be 
Ag s would choose the highest efforts in 

| E. Aos dination game. We assume that 


Zir + Nits aat. Zm + Tmt) 


rs Eh(2u e.. y Sets Zits wey Sar); 


gansed by such ‘“crossovers’’ approximately 


cancel out on average. This should not be an 
È- Eapartant source of bias, because 





zu = (1 6% + Oa 
= (1 B 6r) (Zit + Nir) 


= (1 E 6:) Nis + Oir, 


š that there are no systematic differences be- 
i scen the orderings of the z, and Z, + na. 
“Ths approximation yields a simple charac- 
ation of the optimistic subjects effect, in 
the order statistic is adjusted to reflect 
Metion’s tendency to select players who 
higher efforts. In VHBB’s environ- 
ere the right to play a nine-person 
e was auctioned in a group of 18, 
approximated by the robustness effect 
aG-person game without auctions in 
order statistic is the fifth highest ef- 
edian of the nine highest efforts). 
the purpose of evaluating Eh, the 
effectively transformed VHBB’s me- 
» Which without auctions would 


“To api : fe dynamics, into a 75th-percentile 
; 15 = 13.5/18) with a robustness effect 
dutes.a large upward drift. 

the magnitude of this drift we 
Broseta (1993a) and Crawford 


Ze + ni: and z, are approximately 


Tobustness effect, that,contributes zero . 
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jointly normal for all i and t. Normality is a 

reasonable approximation because Z;, + ny and 

Zą are weighted sums of serially independent 

random variables. It makes the distributions of 

(Zi; + ni)/o» and Zilo y independent of t as well 
t t 


as i, so that u, = p and v, = v. Given that 
(Zi, + Nu) p and zida y are independent across 
i, u and v can be estimated [using (22) for v] 
from D. Teichroew (1956 Table I) for any or- 
der statistic and m = 20. For VHBB’s envi- 
ronment this yields pw = —0.069 and v = 
0.665, the respective means of the tenth and 
fifth largest [the (n + 1)st and the median of 
the n largest] of 18 i.i.d. standard normal 
variables. 

Continuing to ignore the discreteness of ef- 
fort, Proposition 6 can now be used to analyze 
the sources of the efficiency-enhancing effect 
of auctions in VHBB’s experiment. Assume, 
as in Section IMs econometric analysis, that 
& = efort=1,..., and 6, = ô fort =0,..., 
and set a = @ and a, = 0 fort = 1, ....”° 
Proposition 6 then implies 


(24) EX = a + 5 [5B, F Ys] 


s=0 


t-1 


t 
+6u $, op tev }, ox 
s=0 


s=0 


and 


(25) Ey,=a@+ > [66,4 y] + ôu $, ops 
s=0 =0 


s= 


t-1 
tev ¥ Or + Vou 
s=Q 


which express Ex, and Ey, as functions of p, 
v, and identified behavioral parameters. @ is 
the average initial level of players’ beliefs; 66, 


on 


20 Setting a, = 0 involves no loss of generality because, 
as (26)—(27) make clear, of the four parameters in the 
constant terms of (18) and (19), a, + 6,6, + Yn» only 6, and 
the combinations a, + £, and (1 — 6,a, + y, are identified. 
Setting a, = 0 replaces those combinations by £, and y, 
which now partly reflect the trends in beliefs in a, 


+ 
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TABLE 1—APPROXIMATIONS OF THE DYNAMICS FOR VHBB’s EXPERIMENT 


t a 5B, . Yı buoy: EVO u) VO y Exa Ex, 
0 4,940 —0.050 0.297 —0.036 — 0.811 5.151 5.542 
1 — —0.060 0.358 —0.025 0.622 0.554 6.046 6.600 6.166 
2 — 0.068 0400 -0.020 0427, 0.444 6.785. 7.000 6.610 
3 — —0.073 0.432 —0.017 0.341 0.379 7.000 7.000 6.721 
4 — l —0.077 0.458 —0.015 0.291 0.335 7.000 7.000 6.902 


reflects the adjustment of players’ bids for stra- 
tegic uncertainty and any trends in beliefs; and 
Ys combines the forward induction effect and 
trends in beliefs. duo, is the effect of the dis- 
persion of bids via g,. evo, is the persistent 
component of the optimistic subjects/robust- 
ness effect, which incorporates the inertia of 
players’ adjustments through e. Finally, voy 
in (25) is the direct effect of the dispersion of 
efforts on Ey,. Part of this last effect is tran- 
sient; after the period in which it first appears 
it is multiplied by £ and incorporated into the 
sum in the preceding term. 

Table 1 presents approximations of Ex;, and 
Ey, based on (24) and (25) and Section DI’ s 
estimates of the behavioral parameters, re- 
ported in Table 2. They are computed under 
the parameter restrictions described above and 
simple intertemporal restrictions on 6, y, and 
the shock distributions imposed in estimation, 
described in Section HI, subsection E. The sec- 
ond through sixth columns in Table 1 give the 
terms in (24) and (25) for periods 0—4, with 
the implied values of Ex, and Ey, and the av- 
erage values of x, and y, in the eight trials of 
VHBB’s experiment, Ex, and Ey, for com- 
parison. Ex; is computed by summing the sec- 
ond through fifth columns in row t and adding 
the result to Ex;,_ ;, truncating at 7.000 as nec- 
essary. (For Exio, Exu- 1 is replaced by the es- 
timated value of a, 4.940.) Ey, is computed by - 
adding the transient term vo, to Ex;, and trun- 
cating at 7.000: 

Because some parameters are not precisely 
estimated, we view Table 1’s results as illus- 
trative. The approximations track thé dynam- 
ics reasonably well, somewhat overestimating 
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' bustness effect of the minimum m 






















the rate at which Ex; and Ey, > 7. They suja E 
that of the four components of Ex, and Eya $ 
our theoretical analysis, only two were ina: 
tant in VHBB’s experiment: Y, forward 

duction, and svo,,_,, the optimistic subi $ 
robustness effect. Both favor efficiency, | 
roughly equal strength. In the three perii $ 
(0—2) it takes Ex, and Ey, to reach 6.785 ü fim 


crease in Ey,” 

Our methods can also be used to asses 
likely effects of changes in the treatment’ 
ables, on the assumption that the behav 
parameters do not vary sharply with’ 
changes. Consider replacing VHBB’s 
game with a nine-person minimum gami 
all else unchanged. u = —0.069 as bef 
cause m and n are unchanged, but v 
approximately the robustness effect in 
player game without auctions, in whic 
offs and best responses are determin 
ninth largest (the minimum of the nine, 
of players’ efforts. For this game Tei 
Table I yields v = 0.069, as the optim! 
jects effect is almost neutralized by 


tenfold fall in v reduces evo, ane! 
Table 1 proportionately, yielding app™ 


TEA 


2! These contributions sum to more than 7 028 a 
cause the other two components contribute “Tw 
the approximated Ey, must be rounded down 
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TABLE 2—PARAMETER ESTIMATES 
Xi Y Ar 6 g Onn or hy p 
542 x 90,0949 0.2973 — 0.5287 — 1.4872 1.0000 — 0.5226 
fe (0.154) (0.221) (0.137) (0.383) = (0.217) 
166 ane anan t e eNe carga 
-0.1144 0.3583  -0.2689 0.5287 0.9348 0.6938 0.4665 1.0999 0.5226 
610 (0.177) (0.181) (0.402) (0.137) (0.309) (0.132) (0.072) (0.223) (0.217) 
721 -0.1276 0.3995 —0.2689 0.5287 0.9348 ` .0.4442 0.2987 1.0999 0.5226 
ma (0.195) (0.156) (0.402) (0.137) (0.309) (0.090) (0.073) (0.223) (0.217) 
—0.1378 0.4317 0.2689 0.5287 0.9348 -0.3237 0.2177 1.0999 0.5226 
(0.211) (0.144) (0.402) (0.137) (0:309) (0.075) (0.067) (0.223) (0.217) 
-0.1464 0.4583 -0.2689 0.5287 0.9348 0.2533 0.1703 1.0999 0.5226 
They si (0.224) (0.144) (0.402) (0.137) (0.309) (0.067) (0.061) (0.223) (0.217) 
ae -0.1537 0.4814 —0.2689 0.5287 0.9348: 0.2072 0.1393 1.0999 0.5226 
) were (0.236) (0.152) (0.402) (0.137) (0.309) (0.061) (0.056) (0.223) (0.217) 





?, forwardi 1 
istic subjegeine 
ficiency;wEs 

three pebi 































<== 


salues for Ey, of 5.235, 5.629, 6.031, 6.448, 
‘gad 6,879 in periods 0—4. . 
These values are consistently higher than 
observed in Weber’s (1994) single trial 
this environment, which yielded y, of 3, 4, 
‘and 3 in periods 0—4. We suspect the 


vy a= 

ncreases; 1u 
. TVA 

transienti 


d to asses epancy arises because the minimum is too 
treatment :i-irom the median to justify ignoring 
the be anges in the behavioral parameters, and be- 
ply wi k Eades the approximations ignore the bounds on 
HBB’s: a Ye To predict outcomes accurately by 
jum g zinéthods in this new environment would 


émpirical information about behavior 
tby environments. . 


. Analysis and Simulations 
with Discrete Efforts 


ne Tei st Of our analysis extends to the model. 
> optimises discrete efforts £, outlined in Section I, 
ized bystem on D. The dynamics are still Markov, 
um gamt Slate-vector X, representing players’ be- 


Only difference is in how beliefs de- 
efforts andy, Adapting the arguments 
a (1995 Sec. 5) shows that Propo- 
Land 2—5 hold as stated, and Propo- 


d 6 hold approximatel 
y when the 
feasible efforts is fine enough. 


e than T í oye R i 
atribute ` or probability distributions of the dy- 
led down ting outcome can be estimated 


z Asymptotic standard errors in parentheses; log L = —834.955. 


precisely for the model with discrete efforts ` 
only by repeated simulation. As a check on the 
approximations we computed frequency dis- 
tributions for 5,000 simulation runs with the 
estimated parameters reported in Table 2, un- 
der normality assumptions described in Sec- 
tion III, subsection A. The simulation results 
are reported in Tables 3 and 4 and Figure 1. 
Comparing the simulations with the approxi- 
mations in Section II, subsection E, suggests 
that the approximations overstate the rate of 
increase of. £;, and y, because they ignore their 
upper bounds, which cut off the upper tails of 
the normal distributions on which our esti- 
mates of u, and v, are based. Although this 
problem will probably arise whenever either 
an upper or a lower bound comes dispropor- 
tionately into play, the analysis of Crawford 
(1995) suggests that our approximation tech- 
nique will otherwise yield reasonably accurate 
results.” 


em we wo e. > wir 


22 Crawford and Broseta (1995 Figures I-IV) report 
simulation estimates of Eq, and Ey, for alternative envi- 
ronments in which f{-) (‘f° in Section H, subsection D) 
varies from the fifth to the third or seventh order statistic, 
n varies from 9 to 7 or 11, and m varies from 18 to 16 or 
20, holding other treatment variables constant at VHBB’s 
values. These results suggest that, at least for moderate 
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TABLE 3—-MEAN VALUES OF q, AND y, (t = 0, ... , 9) 








Actual Predicted 
t mean q,, y, mean 4,, yı 
0 4.775, 5.750 4.782, 5.648 
be O $.650,6.375 5.454, 6.207 
2 6.213, 6.625 6.465, 6.864 
3 6.450, 6.875 6.768, 6.963 
4 6.700, 6.875 6.921, 6.993 
5 6.775, 7.000 6.979, 6.998 
6 6.850, 7.000 6.996, 6.999 
7 6.838, 7.000 6.999, 7.000 
8 6.913, 7.000 7.000, 7.000 
9 6.988, 7.000 7.000, 7.000 


HI. Econometric Specification and Estimation 


This section provides a general econometric 
framework for estimating the behavioral pa- 
rameters and reports estimates and tests for 
VHBB’s (1993) experimental data. 


A. The Model 


We continue to assume that players’ bids, 
Pin the market-clearing price, g,, and their be- 
liefs, x; are continuously variable, but we now 
assume that players’ efforts are determined by 
an ordered probit model of discrete choice, de- 
scribed in Section HI, subsection B. 

Substituting X; from (9) into (7)—(8) yields 


(26) Pi 7 B, + Q, + Eyi 


+ (1 E)Xir- 1 pn Cit + Tit 


and 


changes, all three of these treatment variables have sig- 
nificant effects, close to those suggested by our 
approximations. 
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(27) xe = 7, + (1 — 6)a, + ôq, 
+ (1 — 8EY: 
+ a — ôN — ENX, 
+ (Í — Eba + 6, 
for t = 1, ..., andi = 1,..., m, with} 


conditions given by setting presample viel 
to 0. For simplicity we write Ti = G ty 
and v, = (1 — 6)Gin + 6; We maintain & a 
following distributional assumptions. Cg $4 
tional on I, the history of p; and x; thri 
time ¢ (up to and including y,), the vec 
innovations (Tin Vi) is jointly i.i.d. across 
serially uncorrelated, and has a bivariate 
mal distribution with means 0, variances 
og + aż, and o?, = (1 — ô) oh + On 
covariance (1 — 6,0?,. 
The model defined by (26)—(27) determisst 
all m players’ current bids and beliefs, as fix 
tions of the observed values of y,— and q 3 
players’ past beliefs x; , (but not their fi fee 
efforts). For any given t, player i plays thee 
ordination game if and only if (barring tf 
which have zero probability) pi, = % $ 
yields two regimes for x: one in which pf j 
q: and Xj, is completely censored; and ott 
which p; = q, and xy is not directly obser 
but acts as a latent variable determining 
i’s observed effort. Thus, sample separalie? 
determined endogenously by whether œ # 
Pi < q” : 
The resulting model raises three econ, 
ric issues: discrete choice, endogenous ss! 
selection, and latent explanatory variables ® 
discuss these in Section III, subsections B 
constructing the likelihood function as 
We then discuss specification and est 
issues and report estimates in Section 
section E. 


2 We assume that the p, are unobserved, sal Nae A 
the computations tractable we treat q, as exoge oe 
are reasonable approximations in VHBB’s tfe 


gs NO. 

































B. Discrete Choice 


: Because players’ efforts are naturally or- 
i. dered by their payoff implications, we use an 

dered probit discrete-choice model, as in 
Daniel McFadden (1984), in which the x, de- 
temine players’ bids and efforts. We partition 
Èa alate space of x;,—the extended real line — 
Sato fesponse regions G (k = 1, ... , K), where 
the interval (ay 15 a], and a = =% < 
< ag = +, We then Set £, = k if and 
if x, € C. Under our normality 





= Pila, — E- xon] 
and es . 
P section > Dila -1 — E,_ Xu) als 
k: =1,.., K, 


ief *) and (-) denote the marginal-den 
Brisa and the standard normal cirnuletive 
n function, respectively. 

bmmonly assumed that the thresholds 
dependent of i and t. We make the 
assumption that Qa=kt+ VA for k = 
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TABLE 4—DYNAMICS OF y, (t = 0, ..., 6) 


Yo yı X2 y3 ya Ys Ye 
puana 
7 0.125 0.625 0.625 0.875 0.875 1.000 1.000 
0.055 0.329 0.654 0.871 0.964 0.994 0.999 
6 0.500 0.125 0.375 0.125 0.125 0.000 0.000 
` 70.550 0.551 0.315" 0.122 0.036 0.006 0.001 
5 0.375 0.250 0.000 0.000 0.000 0.000 0.000 
0.382 0.117 0.031 0.007 0.000 0.000 0.000 
4 0.000 0.000 0.000 0.000 0.000 0.000 0.000 
0.013 0.002 0.000 0.000 0.000 0.000 0.000 
s3 0.000 0.000 0.000 0.000 0.000 0.000 0.000 
0.000 0.000 0.000 0.000 0.000 0.000 0.000 





Note: Actual frequency distributions in first lines, predicted distributions in second lines. 


1, ..., 6, so that ĉ, is determined simply by 
rounding x; to the nearest integer in the set {1, 
IO bs as 


C. Endogenous Sampling 


Because the subsample of n players whose 
efforts are observed each period is endoge- 
nously determined, and 7; and vy are generally 
correlated for any given player, standard 
maximum_-likelihood estimates of the param- 
eters in (26)—(27) will suffer from sample 
selection bias unless corrective action is 
taken.” 

Let T, denote the set {i| py = a of players 
who (barring ties) play the coordination game 
at time t, which VHBB recorded along with 
the other data from their experiment. We 


24 There is no loss of generality in identifying the scales 
on which £, and x; are measured. The equal spacing of 
thresholds is restrictive, but it is optimal in VHBB’s me- 
dian game when players are risk neutral and x; is player 
i’s estimate of Efy,|1,_,] (Crawford 1995 footnote 27 p. 
127), and fixing the a, greatly reduces the number of pa- 
rameters to be*estimated and identifies the conditional 
variance of vy 

25 Various corrective methods have been proposed in 
the literature. See for example G. S. Maddala (1983 Ch. 
9) for limited dependent variable models and Charles 
Manski and McFadden (1981) or Takeshi Amemiya (1985 
Ch. 9) for qualitative response models. 
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6.5 


Order Statistics and Prices 
5.5 


45 


FIGURE 1. SIMULATED DYNAMICS 


propose a full-sample estimation procedure, in 
‘which (26)—(27) are estimated simultaneously 
using all available information on subjects’ de- 
cisions. For each k = 1, ..., 7, define the 
dummy variable Di, = 1 when £, = k and 0 
otherwise. Let w, denote the vector of param- 
eters of interest at time ¢. Conditional on I, _,, 
the contribution of the (it)th observation to the 
likelihood function can then be written 


qı 
(29) Llo) = lie T} f JoPi) Bir 


7 flee Di, 
+ lier, > I. f Kein Pu) Tit ap, | > 
. k” qı 


k=l 


where 14 is the indicator function of the set 
A, and K-) and f,(-) denote the joint condi- 
tional density of (x;,, p;,) and the marginal den- 
sity of Pi, respectively. The two regimes in 
(29) reflect observations in which subject i 
does not play and we learn only that pi, < qn 
and those in which subject i plays and we learn 
both that p, = q, and that x, € C, when 4, = 
k. In this respect our model resembles differ- 


Period 






























Predicted mean at x 
© — —- Predicted mean a + 1 S.D. 


B — —.- Predicted mean qe ~ 1 S.D. 
f——— Predicted mean yr A 
& ~ —- Predicted mean ye + 15.0. 
B=- —- Predicted mean yg ~ 1 $.0. 
tw — Actual mean ae 


Actua! mean yr 


ential response models for analyzing cf 
section data from surveys with volunliy| 
participation (Stephen Pudney; 1989 Sec? 
or James Heckman’s (1974) model of lä 
supply, with the added complications of & 
dynamic structure and the double inte 
(29). 

This estimation technique makes ff 
demands on the sample. Both the sample® 
lection equation, (26), and the equation ® 
scribing the adjustment of beliefs, (27), =: 
be estimated from observations of Lin J 
q; and qna for which we have only el 
servations per period, plays a crucial 
identifying the parameters. Muthé 
Joreskog’s results, reported in Maddala 
p. 267), suggest that we may not obtait 
precise estimates of the parameters /; t, 
and o2, in (26). 


D. Latent Explanatory Variab 


Standard estimation techniques: $ 
two-stage probit and Tobit models © 
Grether and Maddala [1982] and 
[1983 Ch. 9]), deal with the latent: 















































33 NO. 1 


he right-hand sides of (26) and (27) 
ating the reduced form of the model. 
‘however, the error terms of the reduced 
are serially correlated, so that full 
i tim-likelihood estimation would in- 
foe ece-calculating the T-fold integral of the 
y distribution of x;, and p; (È = 1, ..., m; 
=, T). This is computationally intractable 
nsen for simple types of serial correlation, and 
f= « piodel already involves additional diffi- 
-ties due to endogenous sampling and a non- 
jonary error structure. 
a address these problems by using the re- 
t aeced forms of (26)—(27) to maximize the 
: god function L(w) = II, Lij(e@,), with 
(Wp, ..., @r) and L;{-) as in (29), with re- 
pct to the parameters of interest. Thus we 
{rare the serial correlation, which will result 
È za inefficient parameter estimates, while taking 
2 heteroskedasticity of the error terms fully 
3g account as described below. We are not 
swaro of any consistency or asymptotic nor- 
gality: results for maximum-likelihood esti- 
gators in models like ours. Thus our estimates 
a the behavioral parameters and standard er- 
k tea Hist be interpreted with caution. 


: 88 explained in Section I, subsection C, 
ere isa danger of overfitting unless we rule 
d hoc time variation in the behavioral pa- 
We therefore impose simplifying in- 
poral restrictions on. the parameters, 
OW an adequate econometric descrip- 
‘behavior. As in Section II, subsection 
te, =e fort = 1, ... and 6, = 6 for 
“We also set a = & and a, = 0 for 
5 $0 that a, + 6, = B, and (1 — ôa, + 
Stort = 1, ... ; this involves no loss of 
ue only a, + 8, and (1 — 5a, + 

ütified. We further assume that 6, = 
ah and Ye = Yl + tyr fort = 1, 
Y, following Broseta (1993b) and 
1995), we impose the following 
poral constraints on the variance- 
~~ Matrix of the innovations Ty and vi 
21): 02, = o2(1 + 1), o2, = 02, 
and Cov(r,, Vi) = Cov(Tio, Vio) 
a the correlation coefficient 










O tom 
hniquesss of, 
it modelSayg 





Ine late 
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Under these assumptions, backward substi- 
tution for x;,— ; in (26)—(27) yields 
(30) pir = Bo(1 +t)" + yo(1 -= e) 


x 5 K= E -e)]* a -k+-1)™ 


k=} 


+ (1 — e)'(1 — 6)'"'[yo + (1 - ô)a] 


t 


+e 3}, [(1 9) Se) ik 


k=] 


+ 6(1 - e€) 


k= 1 


x Sa — (1 — €)] qr- + Ti 


+(1-e) 5 EASE e 


k=! 


and 


t-i 


(31) xe =o DY KI- 6)(1 ~ e)]* 


k=0 
x(t-k +1) 

+ {C1 — e)(1 — 6)]'fyo + (1 — ô)a] 

+ e(l — ô) 


x E [O — 61 — 2)}* y- 


k=} 


+6 $ [C1 - 6)(1 - €))"q-3 


k=0 


+ E [C1 — 6)(1 ~e)] Vi- 
k=0. 


-o -w a 


Letting er = Ty + (1 — €) Zr- (A — ô) 
(1 — E) 'vr-r and tr = Zao [A — 6) 
(1 — e) 'v;,_, represent the error terms in (30) 
and (31), it is clear that the ep and uy are 
serially correlated for any given i, but 
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uncorrelated across i. The relevant components 
of the variance-covariance matrix are now given, 
for t = 0, ..., by 


(32) Var(uin) = o% 5 [(1 — 6)(1 — e)” 
ee Oe ; 20. 


X (t-k+1)™, 


(33) Varea) = 07,11 +)” + (1 — €)*o?, 


x 5 [(1 — (1 - e)Pe-? 
k=1 


X(t-k+ 1), 


and 
(34) Cov(ej, us) = Cov(rio, Vi) + A 


+ (1 — e)z 


x y [COG e) i=" 


k=l 


X(t-—k+1)™. 


For each period there are eight observations 
of q, and y, and, counting both regimes in (29), 
144 of £,. The model defined by (28)-—(34) 
was estimated by maximum likelihood, using 
the data from the first six periods of VHBB’s 
eight trials.” There are ten parameters of in- 
terest: Æ, Bo, Yo, Ar, Ô, €, CZy Oze Av, and p.” 
All ten are theoretically identified. However, 


6 Although the trials lasted 10-15 periods, conver- 
gence was so rapid that there was little sample variation 
after period 6 and none after period 7. We excluded the 
data in every period for subject 1 in trial 10, who paid a 
high price to play the coordination game in period 3 and 
then played the lowest effort. Preliminary estimates re- 
vealed that this subject’s behavior had a large impact on 
the estimated o}, The program code and the results of 
intermediate computation$ are available on request. ` 

2? Ten may seem like a large number, but Section I's 
analysis shows that to explain VHBB’s result the model 
must describe the dynamics of a heterogeneous population 
of learning rules. Our estimates strongly suggest that no 
simpler model can do this, and if tighter estimates are de- 
sired further experimental data can be produced at will. 





























aż, is only weakly identified, because 
when the p;, are unobserved the only 
able consequence of an increase in the 
ance is an increase in the variance 
practice the estimated 07, ‘‘blows up” p 
when n/m = 0.5, as in VHBB’s deși 
makes the fit in the sample selection 
(26) approximately independent of the 
mated ĝo and e, which can then be chosg 
maximize the rest of the likelihood 
We deal with this problem by estimati 
model under the normalization o7, = 1,4 
is arbitrary but unavoidably so when the 
unobserved. Fortunately, as just explain 
level of the o7, (as opposed to their time tj 
Ay, which is precisely estimated) has lit 
fect on the model’s implications for g 
able variables. 
Table 2 reports the estimated par 
with asymptotic standard errors in pan 
ses. The estimates confirm that the į 
gives an adequate econometric descriptio s $ 
VHBB’s subjects’ behavior. With the ent RIS 





timated adjustment coefficients ô and ê 
plausible, though the latter, at 0.93, is ma 
higher than the 0.58 estimate for the analog# 
parameter in the analogous treatment witht 
auctions (Crawford, 1995 Table I) and %6 
what higher than our prior. The estimat 
are positive and quite large, also somes 
higher than our prior; together with ow Ñ 
mate of 6 they imply a large forward induc i 
effect, raising efforts nearly half a unit 
the value suggested by q, and prior beli 
average. The estimated £, are negative. 
pected, small (recalling that bids are 
like efforts) and insignificantly different 
0. The estimated o2, are significantly pow 
so that constraints ruling out strategie ! 
tainty are strongly rejected. The estimated 


28 ),. is naturally positive because it represen": 
of decline of B, and y, which should approach Pg 
time as strategic uncertainty is eliminated by ©) gai 
Recall that a, has been normalized to zero, SO ™ <2 gi 
y: include any exogenous trends in beliefs that 
players’ bids (6,) and/or efforts (-,). ; 





























fficient p is significantly positive, 
because Gy enters both 7; and Vin 
T encitive Weights. Finally, the estimated hy 
nificantly positive, confirming that there 
substantial learning effects, and greater 
Dats one (though insignificantly ‘sO), so that 
Be itign 2’s variance conditions for con- 
eis are satisfied.” X ; 
"wWe-estimated the model’s implications for 
We ee probability distributions of the dy- 
amics and limiting outcomes by repeated 
lation, as described in Section II, subsec- 
: gon F, taking the discreteness and bounded- 
“ass of efforts fully into account. Table 3 and 
A Figure 1 report the actual and predicted mean 
E<alues of q, and y, and Table 4 reports the 
dicted distributions of y, in the second lines 
k af each cell, with the actual frequency distri- 
F tations in the first lines for comparison. Figure 
É -1 Mustrates the dynamic interplay between q, 
i; aed y, that underlies the efficiency-enhancing 
ffect, in which q, exerts upward pressure on 
a distributions of the x, and y, via forward 
¥-zaduction, y, then exerts upward pressure on 
a distributions of the x; , the p;,.,, and 
È z1 and so on.” 
BÈ: The actual and predicted means and distri- 
Meee ons seem very close, with the single ex- 
ilon of the distribution of yı. Their 
ieeness is difficult to judge by eye, because 
a Ex frequency distributions for eight trials must 
$ coarser than the predicted distributions. 
t However, x’ tests of goodness of fit, con- 
ected for the distribution of y, each period, 
f come close to rejecting the hypothesis 
¢ actual frequencies were drawn from 


licted distributions.*’ The relevant x° 


> 





si Ar < 0, Proposition 2’s conditions on 8, and 
EES As explained in Section II, subsection C, 
} ~~ occurs despite these violations because, the, 


Upward Pressure on the x, and y, from q, comes 
= en), estimated Y: > 0. 4,4; is not generally 
Fon k use the estimated 8, < 0 and e < 1. 

ge are not independent across periods be- 
dependence; we offer them only as a 


how closely the model reproduces the 
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statistics, each with six degrees of freedom, are 
0.854 with p-value 0.991 in period 0; 5.999 
with p-value 0.423 in period 1; 1.826 with p- 
value 0.935 in period 4; and 0.350, 0.057, 
0.048, and 0.008 in periods 2, 3, 5, and 6. 


‘These results suggest that the model closely 


reproduces the dynamics of subjects’ 
interactions. 


IV. Conclusion 


In this paper we propose, estimate, and an- 
alyze a model of the learning dynamics in a 
recent experiment by Van Huyck et al. (1993), 
whose subjects repeatedly played a coordina- 
tion game with seven Pareto-ranked equilibria, 
with the right to play auctioned each period in 
a larger group. The auctions had a striking 
efficiency-enhancing effect, in that subjects in- 
variably bid the market-clearing price to a 
level recoverable only in the efficient equilib- 
rium and then converged to that equilibrium, 
although subjects who played the same coor- 
dination game without auctions always 
converged to inefficient equilibria. The 
efficiency-enhancing effect of auctioning the 
right to play suggests a new and potentially 
important way in which competition may pro- 
mote efficiency. And its effectiveness in fo- 
cusing subjects’? expectations on desirable 
equilibria may eventually help to unify our un- 
derstanding of preplay communication, via 
““cheap talk’ as well as costly signaling. 

VHBB’s contribution is a good example of 
how carefully designed and conducted exper- 
iments can change the way we think about im- 
portant economic problems. Although it 
conveys a powerful impression by itself, its 
power to inform analysis can only be fully re- 
alized by a theory that identifies the mecha- 
nism behind the efficiency-enhancing effect 
and provides a firm basis for generalization to 
other laboratory or field environments. Work- 
ing with a model that is flexible enough to nest 


` altérnative explanations, as in our earlier anal- 


yses of equilibrium selection in VHBB’s 
(1990, 1991) experiments in Broseta (1993a, 
1993b) and Crawford (1995), we find that ex- 
plaining VHBB’s (1993) result requires a 
model of stochastic, history-dependent learn- 
ing dynamics, in which interactions between 
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nonnegligible amounts of strategic uncertainty 


and the learning dynamics have a persistent 
effect on the efficiency of the limiting out- 
come. Our analysis quantifies this effect, re- 
lating it to the treatment variables (the 
numbers of players in the coordination game 
and the auction, and the order statistic that de- 
termines the robustness of desirable equilibria) 
in a way that makes it possible to assess its 
likely magnitude in other environments. We 
find that auctions can be expected to enhance 
efficiency to some extent in a wide range of 
environments that share the basic ingredients 
of the mechanism we identify —our optimistic 
subjects, forward induction, and robustness ef- 
fects. However, in other environments this ef- 
fect may be too small to assure fully efficient 
coordination. 

Our methodology offers some more general 
insights about the analysis of learning and 
equilibrium selection. Because the extent of 
strategic uncertainty cannot usefully be ex- 
plained by theory alone, its persistent effect on 
the limiting outcome gives the analysis an em- 
pirical component, which we describe via 
exogenous behavioral parameters in our spec- 
ification of players’ learning rules. Much of 
our analysis is independent of the precise val- 
ues of these parameters, but they do affect the 
quantitative results. We close the model, when 
necessary, by estimating them using the ex- 
perimental data. This dependence on empirical 
parameters and the need to analyze stochastic, 
history-dependent learning dynamics are 
likely to be encountered in realistic models of 
equilibrium selection in other environments. 
Although our analysis does not immediately 
generalize beyond VHBB’s environments, we 
hope it shows that such analyses need not be 
intractable, and that the theoretical and empir- 
ical understanding needed to address eco- 
nomic questions involving equilibrium 
selection and coordination may be achievable 
in other settings. 


APPENDIX 
PROOF OF PROPOSITION 0: 


Under the stated conditions Ny = 0, = Ge = 
0 for t = T, .... (11) reduces to 















-Because 6,, &41, 2(Xin «+» » Xm) and k 
Xm Xin «++ 5 Xm) are the same for all pi 
and 0 = (1 — €,4,)1 - ô) <1, (Al) 
the order of the x, (weakly) unchange, 
time. It follows that there exist Players; 
k, such that g(X11, ... 5 Xm) = Xj, and hgs 
Xinti Xin s Xm) = Xg for all t = T, 
then implies 


(A2) Xiri = Ô; + e411 — 6)%;, 
Laelia te 
= [1 — &4ı(l = 6) Xj 


+ &4:(1 — 6) 
and 


(A3) Xuri = 6,Xj1 + €,41(1 ze bia 


a“ 


+ (1 = e) — i)i 


= 6%, + (1 — 6%. 


X, and Xy converge to a common value 
between Xr and %;7, which is completely. 
termined by ¥r, Xr, the ôn and the e, Iti. 
from (A1) and (8) that the other X, and 
verge to the same value, and from ( 
(11), (A2), and (A3) that if T = 0, % #? 
Qo for all i and t = 0,.... i 


PROOF OF PROPOSITION 1: 

As in Crawford (1995 Proposition. 
proof is immediate by induction onc; 
lution has been found. Here we Bi 
method of constructing the solution, %, 
informative. Substituting from (1); I 
(6)-(9) yields i 



















z= Mäin os Xm Pin +» Pm) 
— hir- 1s s Xm- 03 
Mi lOO apud o Pm- 1) 
a, + b+ Y 


+ 6g( — Exu- + Chet Min., 
(1 — &)%mt—1 + Gar + Nm) 
+A — 6) — Exu- 
+ (1 — Êb + Oin a, 
(1 — NL — E)Xm -1 
+ (1 — dbm + Omn Pits «+ 1 ml 


ae (l ai EDhlxr- i sey Xm — 45 


Dit—\> «+ > Pm- i]. 

iš clear from (12) that z; is the idiosyncratic 

mm éaaponent of x;,, from (9) and (15) that Z; is 
k: & idiosyncratic component of Xp, and from 

E 7) shat Z, + ny is the idiosyncratic component 


ff pa The second part of (15) is immediate 


p-wiachi yields the first part of (15) by successive 
f sebstitution. Using (2), (4)-(5), (18), and 
{A4) can be rewritten 





(AT) xr- Yi = Xa — ACK py c > Xm Prev os 
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Summing (A6) and substituting from (14) © 
yields (13). To derive (12) from (13), note 
‘that 


> Pm) 


= Za — hin o p mts Zu F Min e s Zm E Nw) 


= Zu T h,. 


This completes the proof. 


PROOF OF PROPOSITION 2: 

The proof follows the martingale con- 
vergence arguments of Nevel’son and 
Has’ minskii (1973 Theorem 2.7.3). Substitut- 
ing (1), (3), and (7)—(9) into (26)—(27) shows 
that x, can be taken as the state vector instead 
of X,, which is convenient because it is the x, 
not the x; that are directly affected by the 
bounds and by the discreteness of effort. De- 
fine the Lyapunov function V, = Bj; (xi, — x)’, 
where the summation is taken over all i, j = 
1, ..., m.” Clearly, V, = 0 for all x,, with V, = 
O if and only if x, = x, for all i and j. Substi- 
tuting from (27) and simplifying yields 


(A8) V, = x [a oe SCL OM Xie — 1 — Xj D 


+ a one AA my Ci) t Gi, ui bF 


= ZIA =) € (1 a N Ce-i = Xj) 


ij 
+201 — EN — Nur- — Xea) 
x {a p 6) Cx a Ci) + Bir z (J; 


+ {(1 ya Nbi hj Ci) + Bir ga 0y]. 


32 Nevel’son and Has’minskii assume that V, > œ as 
x,| > ©, but they use this condition only to ensure that 
solution paths are bounded; we assume boundedness 
directly. : 
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Taking expectations in (A8) conditional on 
X,- ı then yields 


(A9) E,_,(V,|x,-1) 


= (L7 eD = 5) E r-i = =)? 


+2(1 — &)X(1 — 6) 


XDE(( = ôN — Gd + Oa — 8) 


X Xir- — Xjr-1) 


+ DY E{( — 6) Gy — Cin) + On — Ou}? 


i 


The first term on the right-hand side of (A9) 
is plainly bounded below V, _, for all x, _ , out- 
side any given neighborhood of the set for 
which V,_, = 0. Without the bounds the sec- 
ond term equals 0, and the third term con- 
verges to 0 with probability 1 because the 
finiteness of EF- o of, and EF- o 0}, implies that 
(1 — 6°02, + oh > 0, which implies that 
ERI — 6)G, + 9:]° > 0 with probability 1. 
Thus, without the bounds {V,} eventually be- 
comes a nonnegative supermartingale, so that 
V, > 0 with probability 1. Because the bounds 
can never increase E[(1 — 6,6, + 6}? or [by 
(A8)] £,-,(V,{x,—;), this is also true for the 
bounded version of the {V,} process. In either 
case, it follows that for all i and j, (xi, — x) > 
O with probability 1. Using. (9), the strong law 
of large numbers, and the continuity of A(-) 
then shows that (y, — xa) and (Xr — X) > 0 
with probability 1. y,, x;,, and xą must then con- 
verge to a common limit because X, and x, can- 
not (with positive probability) return infinitely 
often to a given point at which x;, + X, or 
Xi, # X; for some i and j; and X, or x, cannot 
(with positive probability) oscillate infinitely 
often between distinct points at which x, = Xn 
Or Xi = x; for all i and j, because X{_» a,, 
2 =0 bo Sio Ys Ès=0 Biss and Èi o Cis also 
converge to finite limits (the last two with 
probability 1). The convergence of p, and q, 
to the same limit follows from the continuity 
of g(-) and the facts that 8 > 0 as t> œ and 
25=0 Nis converges with probability one. 
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The International Ramifications of Tax Reforms: Supply-Si 










Economics in a Global Economy 


By ENRIQUE G. MENDOZA AND LINDA L. TESAR* 


This p paper studies tax reforms i ina dynamic model of a global economy calibrated 
to current U.S. and European tax policies. World capital markets add 
consumption-smoothing and income-redistribution effects that alter closed- 
economy predictions. In the absence of taxes on foreign interest, welfare gains 
of eliminating U.S. income taxes are enlarged by up to 34 percent, at the expense * 
of European losses caused by transitional declines in consumption and leisure, 
and a permanent capital outflow. In contrast, if foreign interest is taxed, the same 


tax reform reduces U.S. welfare 0.7 percent and increases European welfare 1.8 


percent. (JEL H87, H21, H23, F41) 


Up to this point I have taken no account . 
of international relations .... When this 
assumption is removed, several new and 
large problems arise ... it may be feasible 
for a man subjected to taxation in a taxed 
area to make use of an untaxed area in 
such a way as to reduce the fiscal burden 
imposed upon him .... New and so far 
unexamined dangers are threatened. It is 
clearly important to gauge, so far as we 
can, the scope and range of these in the 
particular case of our own country (A. 
C. Pigou, 1947 p. 165). 


The research program on quantitative as- 
sessments of tax reforms initiated by Robert 
E. Lucas Jr.’s (1990) lecture on supply-side 
economics concluded that far-reaching tax re- 
forms, designed to eliminate savings and in- 
vestment distortions, produce large social 
welfare gains (see, for example, Robert G. 
King and Sergio T. Rebelo, 1990; Jeremy 
Greenwood and Gregory W. Huffman, 1991; 
Thomas F. Cooley and Gary D. Hansen, 1992; 
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Comments and suggestions by participants -at the 1995 
NBER Summer Institute, the World Congress of the 
Econometric Society, the Canadian Macro Study Group 
Meetings, and the June 1995 Meeting of the Panel of Eco- 
nomic Advisers of the Congressional Budget Office are 
gratefully acknowledged. 


-in the process of expanding the capital's 
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V. V. Chari et al., 1994). The welfare 
are large despite the transitional cost in i 


from the lower level of a heavily tax-distonsd 
economy to the higher level of a tax-refor 
economy. Lucas estimated that replacing 
capital income tax with a higher labor 
tax would increase consumption per ca 
about 1 percent per year, and Cooley: 
Hansen showed that the increase can excel! fierd e's 
percent if a consumption tax is used insti 
Gains of this magnitude dwarf the benefit Y 
other major policy endeavors —such as ous © 
and price stabilization— and, in Ls B 
view, constitute the ‘‘largest genuinely i 
lunch’ "ever provided by N wl 
economics. 


tax system is shared by many policy m 3 
in industrial countries, particularly in 
Germany, Japan, and the United States. #8 
United States, for instance, Congress, 
the Kemp Commission on Growth al 
Reform which published a report in 1 
cusing on three major tax-reform prop 
the ‘‘universal savings allowance,” 
provides deductions for all income a 
savings, the ‘‘flat tax,” which cuts inc! 


' Recently, S. Rao Aiyagari (1995) showed 
nating the capital income tax can be subopt! 
are borrowing constraints and incomplete m 
















































Pi 


ingle 20-25 percent rate and makes 
‘enue losses with growth gains and 
indirect taxes, and the ‘‘consumption- 
‘ooly.” which Samant esr teed 
aya income tax with a value-added tax. 
oe Spite ie strong interest in radical tax re- 
the part of researchers and policy 

Pigou’s concen for analyzing the in- 
‘signal ramifications of tax reforms and 
Fying their impact has remained largely 
for over half a century. In fact, all ex- 
titative studies in the tradition of 
5 supply-side lecture assume that inter- 
Į financial markets do not exist. This is 
gesply at odds with the unprecedented glob- 
afization of capital markets that the world has 
iced recently and with the continuing 
acess of trade integration. The emergence of 

. #2 and sophisticated global markets brings 
ter stage Pigou’s concern for under- 
sanding how tax-reform assessments vary 
bea the public has access to world trade. This 
# the central theme of this paper. In particular, 
(ag paper undertakes a quantitative examina- 
“ef the effects of tax reforms from the 
tive of a two-country dynamic macro- 
fic model with fully integrated capital 
sad goods markets, and calibrated to reflect 
$ Æt tax policies of the United States and 


$ i Werld financial markets play a key role in 
-positive and normative aspects of tax 
f8.:Without access to external borrow- 
households finance the accumulation of 
tal during the tax-reform transition by sac- 
Consumption and leisure, and hence 
Reser re large social costs associated with tran- 
onal amics. In contrast, world financial 
Provide mechanisms for sharing the 
benefits of tax reforms interna- 
rand hence produce sizable global 
S ‘along the lines examined in the an- 
Work of Jacob A. Frenkel and Assaf 
(1992) and Greenwood and Kent P. 
h (1985). 
_MWanttative analysis shows that trade- 
nancial markets magnifies the ben- 
forms. The net welfare gain of a 
teplacing the capital income tax 
mption tax in the United States is 
an the Open-economy model, 
€rcent larger than in a closed- 


Kat 
95) showed Deg 
be suboptrsal 
pmplete males 
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economy setting. A similar reform eliminating 
the labor income tax produces a welfare gain 
10 percent larger for an open economy than 
for a closed economy. In both cases, the con- 
sumption tax increases sharply in order to sat- 
isfy the government’s intertemporal budget 


constraint, keeping constant the-levels of gov-- 


ernment expenditures and welfare payments. 
Most significantly, the transition paths for con- 
sumption and work effort change dramatically 
in the open economy. These calculations as- 
sume no change in the long-run growth rate of 
the economy, so the results do not rely on a 
permanently higher growth rate. 

The international transmission of tax re- 
forms operates through two key effects. The 
first is a smoothing effect reflected in external 
borrowing by U.S. households to smooth in- 
tertemporally the sacrifice of consumption and 
leisure implicit in the cost of the transition. As 
a result, the United States runs sizable trade 
and fiscal deficits in the short run and the cost 
of transitional dynamics falls sharply, from 7.6 
percent in a closed economy to 3.4 percent in 
an open economy, when the capital income is 
replaced with a consumption tax. The second 
effect is a long-run income-redistribution ef- 
fect. This effect captures the notion that the 
debt accumulated during the transition is serv- 
iced by a larger trade surplus in the long run. 
This mechanism transfers part of the long-run 
gains of the tax reform abroad. Hence the util- 
ity gain measured by comparing pre- and 
post-tax-reform steady states, ignoring the 
costs of transitional dynamics, is smaller in an 
open economy than in a closed economy. 

The smoothing and income-redistribution 
effects produce international spillovers of do- 
mestic tax reforms, which affect the dynamics 
of foreign borrowing and the world interest 
rate, and hence cause large global externalities 
in response to unilateral tax-policy choices. In 
fact, the additional gains accruing to U.S. 
residents by borrowing internationally are 
matched by welfare losses in Europe, so tax 


reforms -in an open economy are not ‘‘a gen-. ... 


uine free lunch,’’ as Lucas (1990) concluded. 
In the short run, the smoothing effect induces 
European households to reoptimize their port- 
folios from physical capital into international 
bonds, leading to a large capital outflow, and 
to consume less and work harder in order to 
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generate the trade surplus to match the U.S. 
deficit. These movements affect adversely Eu- 
rope’s welfare, and are only partially offset by 
the long-run income-redistribution effect. The 
latter leads to a long-run increase in European 
consumption and leisure, although Europe’s 
capital stock falls permanently. These exter- 
.. nalities thus provide incentives for strategic 
behavior by fiscal authorities similar to those 
that motivated the large theoretical literature 
on world tax competition (see Torsten Persson 
and Guido Tabellini, 1995), but to date no at- 
tempts have been made to quantify their 
magnitude. 

We also examine how the international 
ramifications of tax reforms depend on the 
general structure of tax policies. Three im- 
portant lessons emerge from this analysis. 
First, taxes on foreign interest income alter 
the incentives for external borrowing by add- 
ing a new margin of distortion between the 
intertemporal marginal rate of substitution in 
consumption and the world real interest rate. 
As a result, the distribution of welfare gains 
of tax reforms across countries can change 
dramatically. A tax reform that eliminates the 
U.S. domestic capital income tax, leaving in 
place a tax on foreign interest income, causes 
a 0.7-percent welfare loss in the United States 


and a 1.7-percent welfare gain in Europe.’ 


Second, a tax reform in the United States low- 
ers European tax revenue, and hence may 
force Europe to increase distortionary tax 
rates in order to maintain intertemporal fiscal 
balance. In the best-case scenario in which 
the European policy response is to increase 
the consumption tax, the European welfare 
loss induced by the abolition of the U.S. cap- 
ital income tax is doubled. Third, despite 
these large global externalities, it is possible 
to identify simple worldwide tax reforms that 
produce Pareto improvements. For example, 
the United States and Europe can obtain siz- 
able welfare gains if they jointly undertake 
the elimination of the capital income tax in 
favor of higher consumption taxes. 

. » The paper proceeds.as follows. Section I de- 
scribes the model and discusses the numerical 
solution method. Section II conducts tax-reform 
experiments, and explores the implications of al- 
ternative tax-policy scenarios and changes in key 
parameters. Section III concludes. 


Classical, two-country . model.. Both co 




























I. A Dynamic Macroeconomic Mode 
International Tax-Policy Analysis. 


A. Households, Firms, the Public Sect 
and Financial Markets 


The analytical framework is a dynami¢’ 


produce a single composite commodi 
trade both this good and real one- 
bonds, issued by the private sector in egg: ; 
country, in perfectly competitive intemati 
markets. The description of the model is} 
on home-country decisions and, wh 
needed, foreign-country decisions are iiti 
duced using asterisks to denote fore 
variables. -ag 
The long-run rate of output growth işi 
ogenous at a rate y, which is commo 
ross countries and across expenditure fip 
within each country, and is driven by labs: 
augmenting technological change. This 
sumption restricts the permissible s 
functional forms for preferences and tec 
ogy to the class of functions that supports b; 
anced growth. The specification of the 
is simplified by transforming all variabl 
cept employment and leisure, into station $ 
variables by dividing them through by the 
of technological progress. The transfo 
variables are written in lower case. This 
trending method also requires well-knownf, 
definitions of the subjective discount fat: 
and laws of motion for asset accumul 
The paper focuses, without loss of gene 
on the competitive equilibrium of th 
trended model. Pe 
Each country is inhabited by identical 
finitely lived individuals. The represen 
household in the home country m F 
conventional isoelastic lifetime utility func 
over intertemporal sequences of consum}: 
(c,) and leisure (L,): 
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e [el 
o pete, 
= 


t=0 


o>], 


The stationary transformation of the modd. ; 
quires that 6 be defined as 6 = B(1 +t Ts 
where B is the true subjective discount” 
The household maximizes (1) subject? 
following sequence of budget constat: 
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; iyen relative prices and fiscal policy 
gisbles, for each date t = 0, ... , %: 


-atre+ +y) 


a>. [Kr +1 + Rb, +) + Rid, +1] 


ynami¢ 

ith counts =(1- TWN, 

imodity ag 

One -enagy HUHA- TA- Ok 


— U(k,, x) + b+ de 


ms ar 4 old’s expenditures, which include pur- 
note fi chases of consumption goods, inclusive of a 

4 “Sales tax, Te capital goods, k, + 1, private inter- 
growth IeM.. eational bonds, b,,,, and domestic govern- 
commonts ment bonds, d,,,. For simplicity, bonds are 
nditure flow resented as discounted bonds, so the real 
ven by. MSp -retums of private and public bonds are (1/R,) — 
nge. Thig and (1/R*) — 1, respectively: The right-hand 
ssible ‘s¢ side of (2) is the household’ s disposable factor 
sandt . gid nonfactor income. Factor income is de- 
t support > gyed from supplying capital (k,) and labor (N,) 
1 of the mo @ firms at pre-tax rental rates w, and r,, and 
| varjablés Wi. ied at rates 7, and 7,, with a provision for a 
into statio - depreciation allowance. Installed capital de- 
igh by the - preglates at a fixed rate 6, and additions to in- 
e transfora. -<alled capital incur capital-adjustment costs 
case. Thisi@® , where x, is net investment (1 + y)k,+ı — 
well-known, kı These costs, or similar frictions like 
discount: {a n lags, are required in dynamic open- 


momy models to differentiate physical from 
lal assets, and thereby prevent the in- 
ous adjustment of the domestic mar- 
‘Product of capital to the world interest 
še Mendoza, 1991). Without adjustment 
these models cannot produce transi- 
‘ nies and predict unrealistically 
vs gS in investment rates and current 
> scent balances. The last two terms in (2) 
t nonfactor income derived from pub- 
aid private bonds. This income is not taxed, 
WE Show later that allowing for taxation of 
terest income has important impli- 
Households also face a no-Ponzi- 


Lof the m rn She ction, Limy ~- (II oR )br = 0, 
= B(1 AIEE ca o E oer with (2) implies that the pres- 
s discount! £ £ household disposable factor in- 


als that of expenditures plus an 
holdings bo. j 4 


Implicit in equation (2) is the assumption 
that domestic capital and public debt are 
owned only by. domestic households. This is 
an extreme assumption, but it has the advan- 
tage that it allows the model to support com- 


__petitive equilibria,in. which there is free 


international trade in private bonds and differ- 
ing capital income tax rates on residents of the 
two countries. We explain below that this is 
not possible with cross-border trading on eq- 
uity or public debt (see also Frenkel et al., 
1992). 

The household’s constraint on the allocation 
of time between labor and leisure is: 


(3) L,+N,=1 


where the total number of hours is normalized 
to one. Labor is immobile across countries. 
Firms maximize profits subject to constant- 
returns-to-scale technological constraints, tak- 
ing as given factor prices. Thus, firms employ 
inputs according to marginal productivity . 
rules and earn zero profits in equilibrium. The 
production function is Cobb-Douglas: 


(4) F(k,, N) = kt > °NF. 


Fiscal policy is represented by an intertem- 
poral sequence of unproductive government 
expenditures, g, for t = 0, ..., ©, and a set of 
tax rates. The date-t government budget con- 
straint is: 


(5) &: + d, = Tr, — 6)k, + TWN, 


+ Tec, + (1 + y)Ridpa1- 


The left-hand side of (5) represents uses of 
government revenue (i.e., goods purchases and 
debt payments). The right-hand side includes 
tax revenue and newly issued debt. Since gov- 
ernment purchases and tax rates are exogenous 


policy choices, the, government is assumed, to . _. 


issue new debt as needed to satisfy its budget 
constraint. Government also faces a no-Ponzi- 
game constraint, Limy -œ (Il =oR$)dr = 0, 
which jointly with (5) implies that the present 
value of government expenditures equais the 
present value of tax revenue plus the initial 
stock of public debt dy. Moreover, (2), (5),.and 
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the no-Ponzi-game constraints imply that the 
present value of the trade balance equals bo. 

Public debt in this model is ‘‘Ricardian’’ in 
the sense that, given dọ and policy choices on 
government purchases and tax rates, the com- 
petitive equilibrium can be represented with 
the government debt path dictated by (5), or 
with adjustments jn lump-siiin' tratisféers to 
households, tr,, by the amount required to bal- 
ance the government budget constraint each 
period: 


(6) tr, = Tr, — Oki + TWN, + TC; — ge 


The equivalence between the intertemporal se- 
quences of debt and transfers is clear taking as 
given dọ and noting that (5) and (6) imply 
tr, = d, — (1 + y)R8d,.,. This framework 
also allows for a constant level of exogenous 
government transfers to households, repre- 
senting subsidies and welfare programs, which 
can be denoted as T and added as an extra 
right-hand-side term in (2), (5), and (6). 

The market-clearing conditions for the 
world markets of goods and bonds are: 


(T) Ft, N) + F(k#, NF) 
=c, + ck +x, + x* + Vk, x) 


+ WRF, x8) + 8, + BF, 


(8) b, + bř = 0. 


The model’s competitive equilibrium is 
given by sequences of prices [r, r*, R,, w;, 
wi*]r=o and allocations [k, +1, kf. 1, Bra, Oi 
N,, N¥, Cn CĂ, Ly, Li, tr, trž yp. that satisfy the 
first-order conditions of the optimization prob- 
lems faced by households and firms, the con- 
straints of households and governments, and 
conditions (7)—(8)—-given [ko, kf, bo, bž, do, 
dë] and the choice of fiscal instruments. 


B. The International Transmission 
of Tax Reforms ` 


The first-order conditions that characterize 
optimal decisions provide important intuition 
for understanding the international ramifica- 
tions of tax reforms. The first-order conditions 








































for investment and foreign bonds. ins 
country, ignoring adjustment costs, 


(1 + YUe L) 


©) BUC L,+1) 
= (1 — 7)(Filkia1, 11) ~= ô) + 
= R;', 

go CEPU LF) 


BU (cH. 1, L1) 


= (1 — TF CK 1, Ne) - 
ae =R; 

The model supports equilibria where ome ak 
tries trade private bonds and 7, + tR TE 
follows from two implications of condj 
(9)—(10). First, trade in bonds impli 

countries face a common Toe 


Second, the optimal cee allocation sna $ 
capital and private bonds requires that & 
post-tax net marginal products of capital & 
also equalized across countries. As a 
differences in capital income taxes are ofi 
by differences in pre-tax net marginal pole 
of capital. This cannot occur if countries tii 
equity and tax capital income according tù 
residence principle (i.e., home householdsp# 
T, on their holdings of k and k*), In this: 
both pre-tax and post-tax returns.on capita 
equalized, and hence a world compels 
equilibrium requires 7, = TH. A similar r; 

applies to world trade in public debt: egi 
with 7; + 7 can be supported only i 
no world trade in public debt, as we as 
or if public debt is internationally tra 
interest payments on it are tax free. In be a 
case, however, b and d woùld be perf ye 
stitutes and there would not be well- 
portfolio shares assigned to each debt ee... 
ment. The assumption that neither é 
public debt are traded across coun 
strictive, but its has two advantages 
the model simple and it prevents the 
from being clearly at odds with the 


anh 




















































mestic capital income tax 


< in do 
a ntries that we document 


across cou 


derstand the international transmis- 
tax reforms, consider next the impli- 
of a permanent, unanticipated cut in 
3 z country. capital income tax.. Con- 
- i gag (9)- (10) imply that, for a given 

d iq interest rate R;' — 1, the tax cut in- 
the net-of-tax domestic marginal 
‘of capital, and hence arbitrage with 
d market implies that kısı must rise 
: equilibrium. There is no direct 
ge”? effect on foreign capital, since 
‘not traded globally, but to the ex- 
‘hat domestic households borrow from 
ito finance the increase in k,,, they 
capital outflow from the foreign 








whe Moreover, since we are dealing 
H Bo o large countries, when one country 
of condit es its net foreign debt it also alters the 
implies interest rate. Interest rate changes are 


iempötary, however, because the long-run 
‘iatefést rate r is pinned down by the steady- 
‘gata condition r = p — yo, where p is the 
‘time preference, defined as p = 


@ above intuition implies that transitional 
‘gad long-run changes in international borrow- 


;. As a and transitional changes in the interest 
xes are oe i are'the channels for global transmission 
rginal piv fteforms in the model. We condense 


mm 6 the resources that a county obtains 
area 

soi | frdea World markets to lessen the cost of the 
S on capil al dynamics in the initial stages of a 
d compe l'and can be measured by transi- 
ges in net exports — which combine 

debt est rate changes and changes in net 
| only ifthe é et positions. The second effect re- 
as we a # redistribution of income across coun- 
nally trades Occurs because a country that 
free. In alates foreign debt during the transi- 


lamics maintains a long-run trade 
Service that debt. Since.the long-run 
fale is invariant to tax changes, this 
€ct captures only long-run changes 


ither setae 
aes asset positions. , 
antages? t an implication of the above- 
pa effects is that the benefits that a tax- 


untry extracts from world capital 
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markets depend on how its borrowing deci- 
sions affect R,. In the extreme case of a small 
open economy, without adjustment costs, R is 
constant and a cut in 7, is matched by an im- 
mediate and large increase in x, entirely fi- 
nanced by external borrowing. A small 
country will borrow more, and at a lower cost, .. 
than a large country because for the small 
country the world supply of capital is infinitely 
elastic. Neh 

Labor income taxes and consumption taxes 
also have international implications, but they 
are less direct. Changes in these tax rates op- 
erate first through the consumption-leisure 
trade-off: 


Ucn L) _ (= Ta) 


K et) Otro” 


Given Cobb-Douglas production technologies, 
the resulting distortions on labor supply affect 
the marginal products of both capital and la- 
bor, and the effect on the former triggers the 
international transmission mechanisms de- 
scribed earlier. Note also that 7,, and 7, jointly 
distort the marginal rate of substitution be- 
tween consumption and leisure, but their im- 
pact on tax revenue, household income, and 
welfare differs, as shown below. 


C. Income Tax Reforms: Calibration and 
Solution Method 


We study tax reforms in which the govern- 
ment undertakes a permanent, unanticipated 
reduction in time-invariant factor income taxes 
at t = 0.2 Government expenditures and ex- 
ogenous welfare transfers remain fixed at the 
levels go and T, respectively. The revenue lost 
due to income tax cuts is replaced by increas- 
ing T, so that the present value of tax revenue 
equals that of government expenses (assum- 
ing, without loss of generality, that dọ = 0). 
The government adjusts tr, (or issues debt) 
along the transition path as needed to make up 


- wi ba 


2 We follow Lucas (1990) and Cooley and Hansen 
(1992) in limiting the analysis to changes in time-invariant 
tax rates. Cooley and Hansen provide evidence suggesting 
that the extra gains made with time-variant taxes relative 
to time-invariant taxes can be small. 
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for any shortfall or excess of tax revenue over 
expenses. Following Lucas (1987, 1990), the 
net welfare effect of the reforms is measured 
as the constant percentage increase in c,, for 
t = 0, ... , ©, that leaves households indifferent 
between the lifetime utility obtained by re- 
_ maining in the pre-reform equilibrium, and the 
lifetime utility obtained by undertaking the tax 
reform, inclusive of the transitional dynamics 
of c, and L,. This net gain is also decomposed 
into a long-run gain, measured by comparing 
lifetime utility across pre- and post-tax-reform 
steady states, and the short-run cost of the tran- 
sitional dynamics. 

Numerical solutions of the tax-reform ex- 

periments involve the computation of: (a) 
long-run, balanced-growth equilibria before 
and after the tax reform, and (b) transitional 
dynamics between pre- and post-tax-reform 
steady states. The computation of the pre- 
reform equilibrium is based on a calibration 
exercise similar to those undertaken in closed- 
economy studies. In contrast, the computation 
of the post-tax-reform, balanced-growth equi- 
librium and the transitional dynamics differs 
markedly from closed-economy studies be- 
cause in the open economy the two must be 
solved simultaneously. This is because, while 
closed-economy models feature explicit 
steady-state solutions invariant to initial con- 
ditions, in open-economy models there are no 
explicit steady-state solutions for the alloca- 
tions of consumption and private bonds across 
countries, and the dynamics and post-tax- 
reform steady state of b vary with initial con- 
ditions. David Lipton et al. (1982) and Lipton 
and Jeffrey Sachs (1983) examined similar 
cases in which steady-state foreign asset po- 
sitions are part of a two-point boundary prob- 
lem in the context of dynamic, open-economy 
IS-LM models. 
Calibration of the Pre-tax-Reform Equilib- 
rium.—The following conditions summarize 
the long-run equilibrium of the home country 
along the balanced-growth path: 


ee eR eer ee 
0). Sn ATA 
C (ee) ell Od aol 
(13) E: 
y y 


MAR] 
x tb 
a} £=1-*-8-, 
y yoy 7 
7 
1-T, 
(15 joa 
= D Ta Cc 1 Ti 
a> a 
y Lor ae 










































Condition (12) is the steady-state version 
the Euler equation for capital, and expres 
the capital-GDP ratio, k/y, as a function's 
preference and technology parameters ang 
Equation (13) is the law of motion for c 
accumulation, and determines the steady. 
investment rate, x/y, as a function of y, ô, 
k/y. Condition (14) uses the budget constra 
to define the consumption-output ratio, cly 
a function of x/y and the GDP shares of 
ernment purchases and net exports (gly ad Si 
tbly, respectively). Since along the balance: SHEE 
growth path tb/y = (6 — 1)b/y, the private dt $ 
ratio b/y is a simple transformation of iy $ 
Condition (15) follows from (11) and sea’ § 
as a function of c/y; Tn» Te, œ, and a. In pif 
aration for the analysis of Section II, note Gi ; 
(15) also determines N,’at any date in the ae 
librium path, with c/y replaced by c/y, Na 
also that 7, affects both x/y and, through $ | 
effect on c/y, the supply of labor, while nä A 
T. do not affect x/y. “hs 
Formally, equations (12)—(15) are an ee 
deridentified system of four equations va | 
five unknowns (k/y, x/y, cly, tbly, and N 
Equations (12)—(13) are block recursive 
determine k/y and x/y exactly as in a clo 
economy model, but (14)—(15) cannot 
mine c/y, tb/y, and N. Thus, balanced-g% 
equilibria of these variables are nO 
down by steady-state conditions. The £ 
tion of the pre-tax-reform equilibrium“ 7 
vents this problem by taking tb/y fongi fa 
data. More precisely, the system (12)- se 
solved for ô, 2, a, and c/y, given the V8" SEN 
other preference and technology P 
tax rates, and long-run averages ° 
tbly, g/y, and N taken from actual dale: at 
home country is calibrated to U.S. wr 
the foreign country corresponds to Eut ao 
aggregates measured as arithmetic ave "3 Sea 
data for France, Germany, Italy: an 





TABLE 1-—PARAMETER VALUES AND PRE-TAX-REFORM, STEADY-STATE ALLOCATIONS 


6  ———————————————————————— eee 
Tg ee ee a e a ia eo 
Parameter values: 


Technology and preferences: 





ô a Yy n B o a 
0.0161 0.64 0.0039 10 0.993 2 2.675 
Fiscal policy parameters (in percent) 

United States Europe 
Ti 41.5 34.3 
Ta 29.1 38.2 
7 44 . 15.8 
gly” 19.0 21.0 

Pre-tax-reform, balanced-growth allocations (GDP ratios): i 
United States Europe 

Data Model Data Model 
cly 0.65 0.65 0.60 0.59 
xly® 0.17 0.17 0.17 0.18 
tbly —0.01 —0.01 0.01 0.01 
Tax revenue 0.28 * 0.30 0.36 0.41 


Net transfers‘ 0.14 0.12 0.24 0.20 


Notes: Figures in ‘‘Data’’ columns are averages for the 1968—1991 period, based on na- 
tional accounts and tax revenue data from OECD National Accounts and Revenue Statistics. 
* Tax rates are 1991 estimates computed as in Mendoza et al. (1994). 
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level. 


Kingdom. 


period. 









parameter values used to calibrate the 
g el at a quarterly frequency are shown in 
ae The values of g/y and g/y* are easily 
“Sued frdm the data, and are estimated at 19 
y Sent for the United States and 21 percent 
“Tope, including public investment. In 
oe macroeconomic estimates 
me -tates 1s difficult due to complex inter- 
P Séional differences in tax EA (credits, 
ons, deductions, etc.) and to the pro- 
and nonlinearity of tax schedules. In 
work with Assaf Razin (Mendoza et 


° Government expenditures (including public investment) at the general government 


° Private investment rate. Data not available for Italy. The figure shown for Europe is 
the average of the private investment-GDP ratio in France, Germany, and the United 


“ Subsidies and all current transfers. Data for Italy and the UK are for the 1980-1988 


al., 1994), we estimated tax rates for Europe 
and the United States over the 1968—1990 pe- 
riod by combining detailed tax revenue statis- 
tics with information from the aggregate 
balance sheets of households, corporations, 
and government from national income ac- 
counts. Figure 1 plots the estimated tax rates. 
We set the pre-tax-reform tax rates equal to 
1990 values. The estimates suggest that 7, is 
larger in the United States than in Europe (41.5 
percent compared to 34.3 percent) and, con- 
versely, 7, and 7, are larger in Europe than in 
the United States (15.8 versus 4.4 percent for 
the consumption tax and 38.2 versus 29.1 per- 
cent for the labor income tax). These tax rates 
capture the widely accepted view that, 
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Ficure 1. ErFecrive Tax RATES IN EUROPE AND THE UNITED STATES 


compared to Europe, the United States taxes 
consumption and labor income less heavily 
than capital income. 

The’*rest of the calibration follows stan- 
dard practice in real-business-cycle analysis 
(see Chs. 1 and 7 of Cooley, 1995). The per 
capita GDP growth rate is set to y = 1.56 
percent per annum (0.39 percent per quarter) 
and the intertemporal elasticity of substitu- 
tion is set at '/, (i.e., o = 2). We measured the 
annual x/y ratio, adjusted to exclude public in- 
vestment, at 0.17, which is similar to the es- 
timate Cooley and Hansen (1992) obtained for 
the post-war ratio of fixed nonresidential in- 
vestment relative to corporate GDP. The quar- 
terly k/y ratio is set to 2.16 (8.62 annually), 
which is also in line with the figure Cooley 
and Hansen used (2.13). We also followed 


















these authors in setting the labor share, 4 
0.64. Given these ratios and parameter vali 
and the tax rates, conditions (14) and (15 
ply values of ô equal to 1.61 and Be 
0.99 (where 6 = B(1 + y)'~ °). The 
value of r is 6.1 percent per annum. } 
x/y and g/y, conditions (14) and (15)j 
determine a value of a consistent with 8 
allocation equal to 20 percent of timè, oe 
ratio c/y consistent with tb/y = —1 
both conforming to U.S. data. This Í 
a = 2.675 and c/y = 0.65. Preference and 
nology parameters are set identical a A 
United States and Europe to highlight te, 
fects of asymmetries in fiscal Gr 
adopt the same values of a, a, £, 4 
for Europe and allow k/y*, x/y*, an 
adjust accordingly. 








ue ` J MENDOZA AND TESAR: SUPPLY-SIDE ECONOMICS IN A GLOBAL ECONOMY 235 


j.also reports the measures of c/y, 
fy and the GDP ratios of tax revenue 
-ovement transfers for both coun- 
and in the model. The calibra- 
‘set to mimic the U.S. ratios g/y, x/y, 
„but all other ratios are endogenous 
d oe of the steady-state system of the 
>Hence, the fact that the model’s ratios 
” aty in line with the data suggests that 
„reform equilibrium is a reasonable 

n for tax-reform analysis. Note that in 
tax revenue exceeds government pur- 
> over 10 percentage points of GDP, 
because net government transfers to 
olds amount to 14 and 24 percent of 
a the United States and Europe, respec- 
ese transfers measure subsidies and 
sats on account of large welfare, health- 
k and other entitlement programs. As 
zggiained, the model captures this fact by al- 
iwing for an exogenous rebate of a part of tax 
gevenue that is kept constant at the pre-tax- 
wfaim level in the tax-reform experiments. 
 Coasequently, the transitional fluctuations in 
get transfers computed below correspond ex- 
actly to adjustments in the stock of public debt. 
[ke model also features capital adjustment 


































2 
Vk, xX) = 5 ( _ 2) k, 


= x/k, so that adjustment costs are zero 
sadly state. The value of 7 is set so that the 
fage convergence rate of GDP to the long- 
ren, '8 lanced-growth path corresponds to 
pirical estimates that set conditional 
convergence coefficients at about 2-3 
(see Robert J. Barro and Xavier Sala- 
s.1995).* This implies 7 = 10. Barro 
i-Martin (1995) noted that this ap- 
ee Can yield values of 7 that exceed em- 
Saitel evidence from investment equations by 
Margin. However, adjustment costs can 


a. Thi terpreted as a proxy for other fric- 
arence abi mt from the model—such as the ex- 
atical actessig „a nontraded goods sector— which 
righlight eg aggregate capital accumulation. 
Jolicy. 
1, Br ds 


convergence rate is 1/T X7- o n+ i/y) 
Teaches the steady state at T. 


Mendoza and Tesar (1995) show that a two- 
country model with nontraded goods, cali- 
brated setting 7 to the value used in the study 
of international business cycles by Mendoza 
(1991), produces similar results for tax-reform 
assessments as the present model. We focused 
on the one-sector model here for expositional 
ease and examined the sensitivity of the nu- 
merical simulations to changes in the value 
of 7. 

Transitional Dynamics.—The solution of the 
transitional dynamics of a tax reform requires 
the simultaneous solution of the paths of for- 
eign debt accumulation and the net foreign as- 
set positions in the post-reform steady state. 
We propose a method that ensures ‘that post- 
reform, steady-state bond positions are con- 
sistent with debt-accumulation dynamics, 
starting from initial conditions corresponding 
to the pre-reform equilibrium. The method 
blends the King-Plosser-Rebelo (KPR) linear 
approximation algorithm with an iterative 
‘‘shooting’’ routine on foreign debt.* We take 
an initial guess of the long-run bond positions 
to which countries converge after the tax re- 
form (typically the pre-tax-reform positions), 
solve (12)—(15) for k/y, x/y, c/y, and N, and 
linearize around the implied balanced-growth 
allocations. Then we simulate the transitional 
dynamics for 2,500 periods using KPR and 
setting as initial conditions the pre-reform val- 
ues of the state variables kp, k*, and bo. The 
simulations produce a path of foreign debt dy- 


- namics that converges to some long-run bond 


positions. If these differ from the initial guess, 
the new results are adopted as a new guess and 
the process is repeated. The process converges 
rapidly, in four or five iterations, but the dif- 
ference between the initial guess and the final 
outcome can be quite large. For instance, ap- 
plication of the KPR method assuming that the 
pre- and post-reform bond positions are iden- 
tical produces welfare gains for the tax re- 
forms examined in Section II about '/, smaller 


4 An alternative to the KPR method is to solve the se- 
quence of Euler equations as in Cooley and Hansen 
(1992), This is computationally expensive in models with 
a large number of state variables. Our algorithm can mimic 
closely the results of the Cooley-Hansen tax-reform 
experiments. 
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than the true values. The income-redistribution — 


effect is also underestimated by a large 
margin—the correct long-run trade balance is 
a surplus of 3.8 percent of GDP, compared to 
a deficit of 1 percent in the incorrect solution. 

The solution method requires a second 
‘‘shooting’’ routine to ensure that the inter- 
temporal government budget constraint is sat- 
isfied. The algorithm checks whether the 
present values of government expenses and tax 
revenue are equal. If this equality fails, T, is 
adjusted according to a rule that updates the 
tax rate as needed to balance the latest estimate 
of the government’s intertemporal budget con- 
straint. Note that, in the long run, the adjust- 
ment in 7, is smaller the larger the rise in c 
and the larger the rise in y relative to c (i.e., 
the larger the raise in the base of the con- 
sumption tax and the larger the ‘‘supply-side’’ 
effect enhancing income tax revenue col- 
lection). In the short run, however, tax revenue 
declines with respect to the pre-tax-reform 
level, and this requires a larger increase in 7, 
the larger the transitional declines in con- 
sumption and factor incomes. The algorithm 
starts again with the shooting routine on bonds 
using the updated T., and the process is re- 
peated until it converges to consistent solu- 
tions for both 7, and long-run bond positions. 
We assume for now that the foreign country 
uses lump-sum taxes to offset any global fiscal 
effects of the home-country tax reform. Later 
we explore the implications of assuming 7* is 
adjusted to maintain intertemporal fiscal bal- 
ance abroad. 


IJ. The International Effects of 
U.S. Income Tax Reforms 


A. Elimination of U.S. Factor Income Taxes 


The quantitative exploration begins with the 
analysis of a unilateral reduction in U.S. factor 
income taxes for both the two-country case 
and the case in which the U.S. economy is 
modelled as a closed economy. Since tax- 
reform proposals differ markedly on the allo- 
cation of tax cuts across labor and capital, we 
consider two basic tax reforms: a reform that 
replaces the capital income tax with the con- 
sumption tax, keeping the labor income tax 
constant, and a reform that replaces the labor 
































income tax with the consumption tax, leg 
constant the capital income tax. This sti 
is also intended to illustrate how the ings. 
tional implications of two major income # 
reforms differ. As we show, the global os 
overs are most relevant in the case of ca: 2 
income tax reforms, and hence much of 
subsequent aiialysis will focus On this cas 
The top part of Table 2 shows the effecn' M 
the elimination of the capital income tax, p% 
reform reduces T, to zero from its cuia 
value in the United States (41.5 percent), ¢ 
sider first the case in which the United St 
is a closed economy. The model predicts tg 
T, needs to rise to 16.3 percent to mainiy 
fiscal balance. Welfare is 9.8 percent highs 
the post-reform steady state than in the pm 
reform steady state, but the transition to $ 
new steady state implies a hefty social cos 
7.6 percent. Still, the net welfare gain of & 
reform is a sizable 2.2 percent. The impes Name 
effects show that the tax reform causes 4% REN 
fall by 8.3 percent and Lo to rise slightly y 
0.1 percentage points. The rise in the om 
sumption tax increases the relative price ofl 
bor relative to leisure, causing a substitui 
effect favoring a fall in labor supply. 0a $; 
other hand, the adverse income effect refl ae 
in the falling share of consumption relative® tl 
output at date O favors an increase in| 
supply. The two effects almost neutralize ee 
other [see (15)] and result in the small ns3 
Lo. The fall in the consumption-output ratk 
due to the increase of 7.3 percentage poll 
Xo/Yo, as the process of capital accumulati 
the transition to the new long-run equili 
begins. This process causes a temporat} 
crease in ry of 3 basis points in annual tem 
Consider now the results of the op 
economy. model. The effects of the tax ref 
are radically different in the follow; 
key dimensions. a f 
(1) Intertemporal smoothing. — The C% = 
the transition is sharply reduced by intet; 
poral smoothing through borrowing on alk 
national capital markets. On impact, thoes 


Ta 


a ve J 
i aa iian ae a Ea 


5 This estimate is comparable with the Lepea vd 
obtained by Cooley and Hansen (1992) for a S PE. i 
periment replacing 7, with 7, under a slightly 5 bie ot REN e 
parameterization. n 
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m tax: E ; : TABLE 2—MACROECONOMIC EFFECTS OF INCOME TAX REFORMS IN THE UNITED STATES 

ie y : = Closed economy Open economy 

4 inci A à . United States United States Europe 

e global ; Impact Long-run Impact T: Long-run Impact Long-run 

zase oftes : : effect ` effect effect effect effect effect 

> much 12 RET. Replacing the U.S. capital income tax with a consumption tax 

dn this'cag 

5 the € fies a Š 

come tatia ie 0.000 0.000 0.000 0.000 

m its ‘Cary k: ; 16.333 16.333 14.775 14.775 

percent) a 

; United: SE ; P 

el predicts ‘es cost coal ~3.415 —4,860 

ef i Èe A 9.805 6.309 4.036 

scent hight 2.161 2.894 —0,824 

an oe a hanges 

ansition of tage c 

y social ~0.456 13.490 -2.795 19.326 4.262 -5.037 

are g TEE > —8.343 6.991 —3.858 7.099 ~4.031 1.082 

t$ Th TE -o 0.000 52.371 0.000 60.206 0.000 -5,037 

m causes l 

rise sligh 

f — — —9.886 3.817 - 

se in th 3 9.969 4.944 

tive p 3 ` 7.260 5.899 11.205 5.899 —3.547 0.000 

ga substitua. g 0.030 0.000 0.016 0.000 0.016 0.000 

supply.'0 É i 0.144 0.778 0.867 —0.221 —1.192 0.891 

i ef fect re Replacing the U.S. labor income tax with a consumption tax 

ption rel 

crease mi . 

: neutralize ‘ 0.000 0.000 0.000 0.000 

the small a a 29.375 29.375 28.795 28.795 

n-output CaN 

‘entage po 7 

accumulates cost ~1.697 —0.696 —1.042 

nied ulia f te gain 4.835 4.148 0.791 

a tempori et chang 3.138 3.452 —0.251 

in a ; e changes 

z a s 5.921 8.760 5.390 10.045 1.031 —0.998 

: followi mc i 7.684 11.335 8.811 11.500 ~0.936 0.215 

: 0.000 8.760 0.000 10.045 0.000 —0,998 

z. —The int changes 

ced by : l = = -1.935. .-0878  . 224% 0.940 

rowing Obes i 

mpact ti", i i 0.079 0.000 0.887 0.000 —0.880 0.000 
0.006 0.000 0.004 0.000 0.004 0.000 

— 1.904 —1.774 —1.713 —2.013 ~0.286 0.177 

h the 2.3-PC umn tees za SDP domestic produce (GDP), c = consumption, k = capital stock, rb/y = trade balance-GDP ratio, x/y = 

392) for a Si ratio, r = real interest rate, and L = leisure hours. Figures are changes relative to pre-tax-reform, 

ar a slightly Sanaa n allocations at quarterly frequencies, except the interest rate, which is annualized. Initial levels in closed 


es differ slightly because the latter is calibrated to reflect a trade deficit of 1 percent of GDP. Adjusting 
has negligible effects. - 
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by almost 10 percentage points to finance the 
increase in investment and smooth consump- 
tion. The fall in co is about '/, the size of the 
decline in the closed-economy case, and ro 
rises by only half as much. The cost of the 
transitional dynamics falls from 7.6 percent to 
3.4 percent. i 
(2) Income-redistribution effect. —The 
income-redistribution effect results in a 
smaller steady-state gain in the open econ- 
omy because the United States runs a per- 
manent trade surplus of 3.8 percent of GDP 
to service its foreign debt. Still, the welfare 
gain net of transitional dynamics reaches 2.9 
percent, exceeding the 2.2 percent of the 
closed economy by about '/. 

(3) Dynamics of labor supply. —The impact 
and long-run effects on labor and leisure differ 
markedly from the closed-economy case. In 
the closed economy, Ly remains nearly con- 
stant. In contrast, international borrowing al- 
lows L to rise nearly 1 percentage point in the 
open economy. In the long run, leisure in the 
closed economy rises by 0.8 percentage points, 
while in the open economy leisure falls about 
ofa percentage point. The extra output the 
open economy needs in order to maintain the 
trade surplus that services its enlarged external 
debt implies that labor supply must be higher, 
and hence leisure smaller, when world markets 
are considered [see equations (14) and (15)]. 
(4) Adjustment in the consumption tax.—The 
higher long-run levels of labor supply and the 
capital stock in the open economy imply 
higher levels of GDP and consumption, which 
yield higher labor income and consumption 
tax revenues allowing the government to 
maintain budget: balance with a smaller in- 
crease in Te. This occurs despite larger tran- 
sitional declines in tax revenue resulting 
from the short-run falls in labor supply and 
consumption. The result is that 7. increases to 
14.8 percent in the open economy, about 1.5 
percentage points less than in the closed 
economy. 

Figure 2 plots the transitional dynamics in- 
duced by the tax reform. The series plotted are 
percent deviations from the pre-reform steady 
State for the United States, as a closed econ- 
omy (dashed line), as an open economy (solid 
line), and for Europe (dotted line). The model 
produces simulations for 2,500 quarters, but 


`` enue falls more and the fiscal deficit 
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the plots show only the first 200 quaes 3 i 
dynamics of consumption, leisure, ypc 
capital stock, GDP, and net exports re 
intuition developed above, and illus 
smooth dynamics typical of neoclassi 
els with isoelastic preferences and tech 
gies. In addition, the figure plots the d 
of tax revenue and the fiscal balance, Tay; E 
more during the transition of the open ci 
omy than in the closed economy. Interesting: 
these results show that the capital income 
reform produces large external and fiscal & 
icits that persist for over a decade. Thus y 
reform can produce the “‘twin defici ne Al 
nomenon, and rationalize large fiscal and ty, 
deficits as natural equilibrium outco 
economic reform. i 
The open-economy model also predicts ta: $g 
the U.S. capital income tax reform has mg % 
consequences abroad. These are observed} 
the transitional dynamic paths for Europe ia 
in the corresponding columns of Table 2, Te 
additional welfare gain enjoyed by US 
residents due to international borrowings 
matched by a welfare loss in Europe. Europi #9 
consumption and leisure fall on impact ii $ = 
percent and 1.2 percentage points, resp. pafs 
tively, and Europe also suffers a permati- 
decline in its capital stock (of about 5 perë 8 
in the long run) resulting from the outflow € ‘R 
financial capital into the United States: Ar $ 
though European households cannot invest 
rectly in the U.S. capital stock, they bey | 
finance the accumulation of U.S. capital F: 
shifting the composition of their savings hië 
investing in their own capital to internati 
bonds. Thus, the reduction in the U.S. capt 
income tax leads to a global reallocati 
capital via the bond market. The smoot! 
and income-redistribution effects am 
causes of these externalities, the magnitude 
which can be measured by the impact 
long-run effects of the reform on the U.S. t; 
balance and the world interest rate rev! Ey 
earlier. f 
Consider next the implications of th 
‘income tax reform, summarized in the 
panel of Table 2. The U.S. labor ince 
falls from 29.1 percent to 0 and the cob; 
tion tax increases from 4.4 percent to 
29 percent in both closed and open eco 
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nnot inves. FIGURE 2. TRANSITIONAL DYNAMICS OF A CAPITAL INCOME Tax REFORM IN THE UNITED STATES 


SX revenue and fiscal balance.are in percent of GDP. Data plotted are deviations in percent of pre-tax-reform 


i k enem -except trade balance-GDP ratio, tax revenue, and fiscal balance, which are percentage-point deviations 
r saving Pre-tax-reform equilibrium. 


o inte: 

he U.S. 

reallocallongs fexults differ from those of the capital in- Second, because both 7, and r, affect the same 
The smooy ; = reform in three important respects. decision margin (the leisure-consumption 
ffects afte eeuice neither 7, or 7, affect the long- choice), and both labor income and consump- 
e magm! gene estment rate and the intertemporal- tion are equivalent to about two-thirds of na- 
he impa\ S tee decision margin, this tax reform tional income, the elimination of T, requires a 
n the U T on the investment-output ratioin much larger increase in 7, than the elimination 

: rate Te aaa Fun, and has only small effects during ~. of z,. Third, the labor tax is more distortionary . 


mn. Accordingly, the smoothing and than the capital tax, in the sense that eliminat- 


ms he J ee stribution effects triggered by this ing the, former produces larger welfare gains 
d m ee a e acanily weaker, as can be seen in both closed and open economies. This oc- 
bor inc) - me smaller effects on the trade curs because, although the steady-state welfare 
d the Co aE, v&put ratio and the interest rate— gain of the capital tax reform is about two 


rcent E Me ee Walitatively the consequences ofin- times larger than that of the labor tax reform, 
pene -T "towing are the same as before. the cost of transitional dynamics is nearly five 
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refo 
cha 


Ta at the same level of 1.05 percent. 


b , 3 A EE ig 
This scenario assumes zero trade deficits in the pre-reform steady state and is calibrated by setting the inital ba 


Positions to 0, 


times larger. Consumption falls sharply during 
the transition of the capital tax reform while it 
increases sharply in the case of the labor tax 
reform. The opposite movements are observed 
in leisure, but given the assumed substitut- 
ability of consumption and leisure and the 
wage elasticity of the latter, households prefer 
the outcome with the sharp initial consumption 
boom associated with the labor tax reform. 


B. Key Determinants of the Global 
Spillovers of Tax Reforms 


The previous analysis showed that a unilat- 
eral reduction in factor income taxes by one 
country has significant implications world- 


Wide. This result was established in an envi- 
ronment where the United States replaced 


s: Impact effects correspond to changes in the date of the tax reform and long-run effects are changes in posta: 
rm, balanced-growth allocations, both relative to the pre-tax-reform, balanced-growth equilibrium. c is the peros. 
ange In consumption and tb/y is the change in the net exports-GDP ratio measured in percentage points. 

This scenario assumes that prior to the tax reform the home country applies the ‘‘residence’’ principle, so thi ¥ 
Set to equate the tax rates on domestic capital income and foreign interest income. The tax reform eliminates T, but rt 
















Impact Long-run Impact Long 
. Welfare aes ae Woe Se effect, 
Mode] economy change tbly c tbly c change tbly c tbly 
£)s Benchmark case 2.894  -—9.886 -3.858 3.817 -7.099 ..-0.824 9.969 .+4.031- —4.944 
Alternative tax policies 
World tax reform 
(7, = 15.9, 
TÉ = 25.7) 2.441 -1.551 -7.361 0.809 7.114 0.830 . 1.645 —7.824 -0.871 
Adjusting 7* 
(Tt = 18.4) 2.921 -—10.224 -3.706 3.953 7.093 -1.695 10.385 -5.458 -5.206 
Tax on bonds 
(T, = 1.05)" —0.676 ~8.126 —7.108 6.150 3.046 1.680 8.016 -—2.686 —7.865. 
Sensitivity analysis 
Costless f 
investment 3.760 —40.412 -3.977 2.472 8.517 —1.822 40.570 —6.036 —3.045 
Inelastic Jabor 
supply 3.271 -8.237 -2.153 2.427 8.143 —0.370 7.920 -5.602 -2.580 
Zero trade ; . * $ 
deficit” 2.743 —9.959 —4.057 3.756 6.920 0.702 10.404 —4.020 ~5.299 MẸ 
Note 


income taxes with consumption taxes, hold 
constant other key parameters of domestit. 
foreign tax policy. Thus, the next critical 
is to study how the outcome of the, 
sis varies when these considerations 
introduced. 
The top panel of Table 3 reports resul 
three alternative tax-policy scenarios, all 
the case of the U.S. capital income tax rela 
The benchmark results of Table 2 are alsof 
vided to facilitate comparisons. The firsti 
native tax-policy setting considers te 
that, since the U.S. tax reform reduces 
pean welfare, the tax reform is likely. l 
“unsustainable,” in the sense that it * a En 


i 
trigger a response by Europe’s tax eef 


and would likely lead to a game of 907 
competition. Examining this issue 12 





























geno. 1 MEN 


e scope of this paper, but we nev- 
` show that it is possible to design a 
“worldwide tax reform that constitutes 


Longs s P provement over current tax sys- 
This is done in the second row of Table 

bly +h assumes that both the United States 
Wee "Europe replace capital income taxes with 


z consumption taxes. Europe makes a 
„cent welfare gain, instead of a loss, and 
sited States still gains 2.4 percent. The 
‘and long-run effects of this reform on 
16 balance-GDP ratio are much weaker 
the benchmark case because both 


5.206 neal “as are undergoing the transition to a 
Steady-state capital stock. Hence, both 
7.865 (SIME: ‘ecymiries have incentives to borrow interna- 


l; ý and world financial markets do less to 
gatigate the costs of the transition. This also 
lies that the interest rate must rise more in 
ou es tto clear world markets. Note, however, 

4 this “coordinated tax reform’’ does not 
k ‘present a solution to either cooperative or 


‘weaeooperative tax competition—except for a 


53:23 gaaperative outcome based on an additive 
nges inj , weld welfare function parameterized arbi- 
-is the pé wazily-with the appropriate welfare weights. 

ints. She second major change in the tax-policy 
ciple, so tha #tting that we examine involves relaxing the 
ae mbun zuuimption that Europe has access to lump- 
g the int ‘taxation as a means to offset the impact 


US. tax reform on European tax reve- 
Which is not trivial, as shown in Figure 


v&c sein which Europe adjusts its consump- 


taxes, NOSE j maintain the present value of its tax 
f domestica: dentical to the unchanged present 
2xt critical ay government expenses. Clearly, Eu- 

of the ibe worse off because it now absorbs 


impact of the U.S. tax reform using 
ary taxes—-Europe’s welfare loss 


vorts res f nearly 1.7 percent, as 7* rises from 
enarios 18.4 percent. Thus, the benchmark case 
yme tax able 2 is an ideal scenario that gives 


the privilege of responding to the bud- 
. The fi eta wets of U.S. tax.reforms using non- 


siders they taxes. We showed that even in 
1 reduces 

is lik 

e that: 

dst a and Tesar (1995) we study the equilibria 
is f glo : Stekavion games under cooperative and nonco- 
ne of ior that are triggered by the global exter- 
issue forms. 
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that case the global externalities are very large, 
and that relaxing this assumption only mag- 
nifies them. 

The last important consideration regarding 
the tax-policy stance is to relax the assumption 


. that foreign interest income is not taxed. This 


is critical for two reasons. First, the assump- 
tion is very unrealistic. The majority of indus- 
trial countries tax all individual capital income 
according to the residence principle [see 
Frenkel and Razin (1992) and Frenkel et al. 
(1992)], so income from foreign securities is 
taxed at similar rates as domestic capital in- 
come. Second, opening the economy to trade 
in financial assets implies that tax policy has 
a new margin of distortion—the wedge be- 
tween the intertemporal marginal rate of sub- 
stitution in consumption and the world interest 
rate. 

The fourth row of Table 3 considers a case 
in which the United States applies the ‘‘full’’ 
residence principle to domestic capital and for- 
eign interest incomes in the pre-tax-reform 
equilibrium (i.e., 7, applies to both foreign in- 
terest and domestic capital incomes). Because 
we assumed discounted bonds, the easiest 
manner to introduce a foreign interest tax is 
via a tax on bond purchases 7,, so that the 
amount of resources invested into foreign 
bonds in equation (2) becomes (1 + y)R,1 + 
T,)B,,. The value of 7, consistent with the 
pre-reform value 7, = 41.5 percent is T, = 
1.05 percent. Note that, for reasons argued in 
Section I, subsection B, residence-based tax- 
ation and global trade in bonds require that T, 
be set at the same rate in both countries. In 
addition, although the tax on bonds distorts 
both the short- and long-run real interest rates, 
the reform we examine maintains T, un- 
changed. This ensures that there is a transitory 
increase in the world interest rate, while the 
long-run real interest rate is the same be- 
fore and after the reform (as in all other 
experiments). 

The results of the experiment with tax on 
bonds show that the distortion on the incen- 
tives for international borrowing alter the 
world distribution of the gains of the tax re- 
form. The welfare effects now amount to a 
loss of 0.7 percent for the United States and 
a gain of 1.7 percent for Europe. Inspection 
of the transitional dynamics shows that the 
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transitional increase in the rate of interest is 
larger when the tax on bonds is present (2.2 
basis points versus 1.6 basis points in the 
benchmark case), and the increased cost of 
borrowing results in a smaller trade deficit 
and a larger decline in consumption for the 
United States along the transition path. The 
smaller trade deficit on impact reflects the fact 
that the U.S. economy is borrowing less, but 
the long-run trade surplus grows to 6.2 per- 
cent, compared to 3.8 percent without the tax 
on bonds. This occurs because the long-run 
cost of servicing the debt is higher with the 
tax on bonds—the long-run interest rate is 
now given by r © p + T, — yo. In short, the 
tax on bonds- weakens the smoothing effect 
and strengthens the income-redistribution ef- 
fect, and since the former benefits the tax- 
reforming country and the latter the foreign 
country, there is a redistribution of welfare 
gains favoring Europe. Moreover, there is an 
additional fiscal effect because the larger 
transitional decline in consumption implies 
that the consumption tax must be increased 
more in order to satisfy the intertemporal 
budget constraint of the government—r, 
reaches 19.3 percent versus 14.8 percent 
without the tax on bonds. 

If one reconsiders the labor tax reform in 
the presence of a tax on bonds, there is still a 
sizable redistribution of the welfare gains— 
U.S. welfare increases 2.8 percent instead of 
3.1 percent, and European welfare increases 
0.3 percent instead of falling 0.3 percent. In 
this case, however, the international transmis- 
sion channels are weaker, and in fact contrib- 
ute to make the U.S. tax reform sustainable by 
producing a welfare gain for European house- 
holds without causing a welfare loss in the 
United States. 

In addition to the overall tax-policy envi- 
ronment, the international implications of tax 
reforms depend on the structure of preferences 
and technology, in particular on the curvature 
of the cost-of-adjustment function, 7, the lei- 
sure exponent, a, and the initial stock of for- 
eign assets, bo. The second panel of Table 3 
conducts sensitivity analysis for these three 
key parameters in the case of the U.S. capital 
income tax reform. 

First we examine the implications of sharply 
lowering 7, to a value nearly 22 times smaller 


































than in the benchmark. Predicta} 
smoothing and income-redistributig 
are much stronger, and the welfare 
of the home (foreign) country is larg 
adjustment costs are trivial, k and p 
perfect substitutes, and hence as the tax 
increases the post-tax domestic marginal 
uct of capital, households borrow as 
necessary to enlarge k immediately 
reset this marginal product at the ley, 
world interest rate. The latter rises mosi: 
before during the transition, because the e7aE 
country is relatively large in the world masae 
but still the home country’s trade defici gi 
share of GDP widens by more than 49 
centage points on impact, more than four 
as in the benchmark. Note, however, 
justment costs are needed only to 
enough ‘‘sand in the: wheels?’ of capi 
cumulation on the margins represented b 
ditions (9)—(10), and that this is accomp 
with adjustment costs that are trivial r 
to the size of the economy. In the bench 
case, U.S. adjustment costs converge rap 
and monotonically, to zero from a maxim @ 
of only 0.8 percent of GDP on the date thas 
reform is introduced. We conclude, thers 
that while the positive effects of the tax 
are highly dependent on the value of 
U.S. (European) welfare gains (losses) fi $y 
between 2.9 and 3.8 (—0.8 and — 1.8) forti g 
large variations in the curvature of adj 
costs. 
The case of a = 0, which effectively 
labor supply inelastic at the level of tht 
endowment, is examined next (second 
the bottom panel of Table 3). This ex, 
demonstrates the importance of the in 
between consumption and leisure in thes 
come of tax reforms. Setting a = 0 wi 
both smoothing and income-redistmt 
effects, as reflected in the smaller 1x3 Gages 
and long-run changes in the Us af 
balance-GDP ratio relative to the benC®=. 
case. Despite these weaker effects, 


| 


. welfare gain is larger, and the Buropeal De 


smaller, than in the benchmark cast 
somewhat counterintuitive result follows? 
two effects: ae 
(1) Lowering a in the United States 7 

the welfare gains of the tax reform fo 
reasons as in a closed economy (se 






























Ni 


e system (12)-(15) implies that a 
allows a given capital income tax cut 
-ate more output and consumption in 
ower utility cost in terms of 


z anomies. In the open economy, c/y 
‘alts both because the rise in k/y increases the 

v eeceemant-output ratio and because the tax 
3 to an increase in the net exports-GDP 
ervice the external debt. Given c/y, 
) implies that N is decreasing in a, so the 
‘labor supply increases when a falls, 
‘given ky, this results in larger output. 
creases because with the lower a the 
asein consumption is valued more than 


ately Solas 
1e level on 
ses mores 
ause the 

world rim 


provide a wider tax base, and hence im- 
the increase in 7, needed to maintain 


wering a* along with a weakens the. 
spillovers of the tax reform because 
an be no long-run outflow of physical 
fiway from Europe. Europe’s long-run 
ik*/y* is pinned down by the foreign- 
$abtFy version of (12), so the ratio does not 
$ eBzage because T is unchanged, and with a 
f Cabb:Douglas technology this ratio satisfies 
k*/N*)"-®, Given that N* is 
artc and k*/y* is constant, it follows that 
yénd:y*:are invariant to a U.S. tax reform. 
a short-run capital outflow from 
iggered by U.S. borrowing and tem- 

er world interest rates. Moreover, 
ith a’= 0, output in both countries is 

ed on the date of the reform, the 
the adjustment required to generate 
European trade surplus on impact is 


s (losses)iy 
q —1.8) fog 
re of adju 


ption. 
fare impact of the weaker interna- 
ssion can be isolated by measur- 
ess of the open-economy welfare 
the closed-economy welfare gains 
ñ values of a. With a = 2.675, the 
nearly */, of a percentage point, 
= 0 the difference narrows to 
fage points. 
y; the dynamics of convergence 
€d-growth path for European 
in the experiment with a = 0 


fa larger transitional decline in Eu- 
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are not monotonic, in sharp contrast to the 
benchmark case illustrated in Figure 2. Both 
European output and capital decline in the ini- 
tial stages of the transition and then increase 
until they converge to their pre-tax-reform lev- - 
els. Continuity implies that these nonmono- 
tonic dynamics remain a feature of the model 
for low values of a. Thus, it is in principle 
possible to rationalize the observed nonmono- 
tonicity of output and investment dynamics 
across countries (see Chari et al., 1996) as an ` 
outcome of the global externalities of tax- 
policy changes. 

The last row of Table 3 examines the case 
in which the U.S. pre-tax-reform foreign asset 
position is zero, instead of the amount consis- 
tent with a 1-percent, long-run trade deficit as 
a fraction of GDP. This implies that the United 
States undertakes the tax reform starting from 
balanced trade, and is equivalent to reducing 
U.S. financial wealth from bọ = 0.62 (which 
is consistent with tb/y = —0.01) to bọ = 0. 
Since the present value of the household’s ex- 
penditures must equal the present value of in- 
come plus financial wealth, the reduction in bo 
has an adverse effect on the U.S. welfare gain. 
The effect is quantitatively small because the 
initial bond positions and trade deficits in the 
benchmark case were small. However, since 
trade and foreign asset positions vary widely 
across countries and over time, and since the 
relevant value of b is the one prevailing exactly 


‘on the date of the tax reform (not an historical 


average), these results suggest that initial for- 

eign asset positions are a key factor determin- 

ing the outcome of tax reforms. For example, 

if the model is simulated assuming that bọ = 

3 instead of 0.62, which implies an initial U.S. 

trade deficit of 5 percent of GDP, the U.S. (Eu- 

ropean) net welfare gain (loss) increases (falls) . 
to 3.5 (—1.3) percent. 

Finally, it is worth noting that despite the 
sizable welfare gains produced by the tax re- 
forms we examined, there still remains a very 
large social cost associated with the need to 
raise revenue through distortionary taxation. 
In a hypothetical scenario in which the current 
structure of distortionary taxes could be re- 
placed with lump-sum taxation worldwide, the 
United States and Europe would make net wel- 
fare gains of nearly 13 and 20 percent, 
respectively. 
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IN. Conclusions 


This paper examines the positive and nor- 
mative effects of tax reforms in the context of a 
two-country dynamic macroeconomic model 
with fully integrated capital and goods markets. 
International capital markets alter the response 


- of the economy to tax reforms through two ef- 


fects: (a) a smoothing effect, reflected in in- 
creased external borrowing to finance the 
expansion of the capital stock while smoothing 
consumption and leisure, and (b) an interna- 
tional income-redistribution effect, reflected in 
the long-run trade surplus needed to service the 
external debt accumulated for smoothing pur- 
poses. The model is calibrated to reflect the cur- 
rent stance of tax policy in Europe and the 
United States, assuming standard specifications 
for preferences and technology, and simulated to 
quantify the international implications of re- 
forms replacing income taxes with higher con- 
sumption taxes. The model is solved numerically 
using a linear-approximation method augmented 
with shooting routines to determine solutions of 
competitive equilibria in which the world redis- 
tribution of financial wealth triggered by tax re- 
forms is properly quantified. 

The smoothing and income-redistribution ef- 
fects imply that measures of the normative and 
positive implications of tax reforms are mark- 
edly different in open economies than in closed 
economies. The net welfare gains of eliminat- 
ing U.S. income taxes can be up to 34 percent 
larger in the open-economy model and the cost 
of the macroeconomic transitional dynamics 
can be cut by more than half. Also, the excess 
welfare gain in the United States is obtained at 
the expense of a fall in European welfare. The 
enhanced efficiency of the U.S. economy, re- 
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ry 


sulting from the reduction in tax distortions, - 


leads European households to reallocate their 
savings from their own capital stock to inter- 
national bonds and to sacrifice consumption 
and leisure during the transition. They recover 
only a fraction of the welfare cost of these ad- 
justments via the income-redistribution effect. 
Further analysis reveals that knowledge of 
the complete structure of direct and indirect 
taxes across countries is critical for assessing 
the outcome of tax reforms. In particular, taxes 
on interest income from external assets add a 
new margin of distortion by driving a wedge 


. fer the benefits of the reform to a fore} 

































between the world real interest ra 
intertemporal marginal rate of substitu 
consumption. Through this wedge, 

foreign interest income weakens the 
ing effects and strengthens the į 
redistribution effect. As a result, a ¢& 
undertaking a unilateral tax reform ças, 


try with unchanged taxes. ; 

Our quantitative investigation demo 
that unilateral tax reforms induce sig 
externalities on the rest of the world; % 
would lead fiscal authorities to engage jy 
tentially damaging international tax 


do identify simple worldwide tax reforms ye 
ing Pareto-efficient outcomes that leave hag: 
holds worldwide better off than under pewa ior 


ods developed here provide the basic 
ents for computing solutions of intem 
tax-competition games, which we are p 
in work in progress (see Mendoza and 
1995). Further research on how our quanti 
results are altered by distributional issues & 
generations and income groups, the introdws 
of stochastic shocks or policy uncertainty, 
endogenous government expenditure choi 
of which have. been found important in thé 
cal work, are worth pursuing. k 

The international effects of tax refom E$ 
particularly sensitive to three elements d@% 
model’s structure. First, capital-adjustiae 
costs affect significantly the quantitati 
tures of transitional dynamics of ma 
nomic aggregates, although they 
relevant for welfaré implications. In ¢ 
the elasticity of labor supply is key fot. 
positive and normative implications.© 
model. If labor is inelastic, a domestic. 
form cannot lead to a permanent ou 
physical capital from abroad, and 
smoothing and income-redistribut! 
sharply weakened. The third key paral 
the net foreign asset position of the! 
reforming nation on the date the refom} 
plemented. -A large net creditor.10. 
markets has a stock of financial wealth t 
be depleted to smooth consumption 
sure, without causing long-run incom”. 
tribution on account of permanen 
payments to service foreign debt. 
















































: ra ae some of our findings have interesting 
subs for other aspects of open-economy 


e, > -< that have been the focus of recent 
: the ‘sm r ziglysis reform produces at the outset 
the ‘ine’ : istent trade and fiscal deficits for 
ult, aiGaaan oming nation, which are sustainable in 
orm i ‘of being consistent with long-run sol- 
1 foreign ame Tance, large fiscal and external imbal- 


untries embarked in far-reaching 
reforms can be equilibrium outcomes, 
‘suming to reduce them can be highly unde- 
‘Second, nonmonotonic convergence of 

nomic aggregates to balanced-growth 
‘hich violate the predictions of the con- 
| closed-economy, balanced-growth 
k, can emerge as a result of the inter- 


rat leay J externalities induced by changes in eco- 
n unde distortions of particular nations. 
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of a Monetary Unih 
By PABLO ANDRES NEUMEYER * 


=~ + dna general equilibrium-model with incomplete asset markets, nominal'securitie 
and mean-variance preferences, a monetary union is desirable when the gai 
from eliminating excess volatility of nominal variables exceeds the cost of re 
ducing the number of currencies with which to hedge risks. (JEL F33, D5 


E44, G10) 


At an informal level it is often argued, es- 
pecially among businessmen and politicians, 
that reducing uncertainty is an important ben- 
efit of a monetary union. At the heart of this 
argument lies the implicit assumption that 
some risks are uninsurable. Most of the liter- 
ature on the welfare effects of exchange rate 
regimes and currency unions, however, does 
not explicitly address the connection between 
the exchange rate regime and the incomplete- 
ness of financial markets.’ This article applies 
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! See Robert A. Mundell (1973) and Elhanan Helpman 
and Assaf Razin (1982) for early analyses of the relation 
between the exchange rate regime and the efficient sharing 
of international risks. 


sive’’ exchange rate variability harmi: 
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general equilibrium theory to investi 
a monetary union affects the efficien 
allocation of risks in an economy with 
plete asset markets. 
The paper compares a monetary unio 
exchange rate regimes that fall ‘short of% ig} 
manently fixing the exchange rate Cnc 
floating exchange rates, fixed exchang 
subject to realignments, and exchangee 
bands). It is shown that the adoption of 
etary union involves a trade-off betwe k. 
benefits of reducing ‘‘excessive’’ exch 
rate risk and the costs of reducing the nusier 
of assets in the economy. 3 
In spite of the general belief that “eE ; 


economy, proving this in a formal setting 
been difficult. The difficulty arises bei $ 
fluctuations in exchange rates that reflet 4 
nomic shocks (income and preference $ 
in this paper) actually help to alloca 
sources efficiently. The paper shows 
exchange rate risks that are not caused by Saas 
nomic shocks reduce the efficiency of insting 

markets. A potentially important source 08 
type of excess exchange rate risk 4 
when political factors influence monetaly 
icy. Political interference in monetaly. ae 
implies that given the realization of ate 
nomic shock, there still is uncertain 
the future actions of monetary authorities ie 
such actions will be influenced by fv 
litical events. For example, the timing” 0 
etary stabilizations in inflationary 00? 
the value at which currencies enter 4 
exchange rate regime, and exchang® 








































ic are instances of monetary policy 
that depend on the realization of po- 
2 Jn such cases the expected vari- 
~of nominal variables is too high due to 
os dence on future political shocks. 
seg fluctuations in price levels are 
ae nominal financial contracts, re- 
; ‘their ability to help agents hedge 
gar économic shocks. Currency unions and 
tly fixed exchange rate regimes can 
wed as monetary rules that attempt to 
welfare by insulating money from do- 
; é politics. l 
; adoption of a fixed exchange rate re- 
‘game a currency union also has costs. When 
ations in the value of money reflect eco- 
“genie shocks, some exchange rate variability 
5d"’ because, by making the real payoffs 
saminal assets denominated in different 
reneles distinct, it increases the insurance 
ssitiniities available through trade in nom- 
assets, The loss of monetary independence 
Me ecisited by fixed exchange rate regimes, or 
| caweney unions, is socially costly because it 
Takes the real payoff of assets denominated 


ing the number of financial instruments 
hich economic agents can share risks. 

-‘]he'imain result of the paper is that switch- 
a from à Monetary regime with national cen- 
f at banks ito a currency union increases 


k í 
3 
i 


tical hesitation, the Menem government fi- 
fd to tackle the problem at its root and launched 
ibility Plan.” See also Alberto Alesina and 
1991) for. the timing of stabilizations. 
aman and Anna Schwartz (1963) discuss the 

tying the resumption of gold and the debate 
after the American Civil War. Jeffry A. 
a) studies the politics of currency crises in 
ü Monetary System (ERM) and Barry 
4992) analyzes the politics of the gold 


ple, current plans to form a European cur- 
ide the creation of a supranational central 
Prevent monetary policy from being sub- 


richt Treaty; see Charles Bean, 1992). 
Ston of the political and institutional as- 
union, see Michael Mussa (1997). 


7 
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y because_they contaminate the- 


of domestic political authorities (Ar- ` 
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welfare when the gain from eliminating excess 
monetary volatility exceeds the cost of reduc- 
ing the number of financial instruments in the 
economy. This case for a monetary union has 
not been previously modeled in the interna- 
tional finance literature. 


.— The modern theory of general equilibrium =~ 


with incomplete financial markets provides a 
natural and fruitful framework in which to 
study the effect of monetary and exchange rate 
regimes on the allocation of risks (see, for ex- 
ample, Michael Magill and Martine Quinzii, 
1996). Helpman (1981), John H. Kareken and . 
Neil Wallace (1981), and Robert E. Lucas, Jr. 
(1982) were the first to use general equilibrium 
models to evaluate the welfare effects of 
exchange rate regimes. These seminal contri- 
butions prove welfare equivalence theorems for 
alternative exchange rate regimes in environ- 
ments where asset markets are complete and 
money is neutral. Following these contribu- 
tions, the connection between the exchange rate 
regime and the efficiency with which an incom- 
plete set of nominal financial markets can al- 
locate risk was studied by Helpman and Razin 
(1982). In their model, a floating exchange rate 
regime dominates a fixed one because the latter 
effectively reduces the number of assets in the 
economy. Helpman and Razin felt that there 
ought to be conditions under which a fixed 
exchange rate regime outperforms a floating 
one, but as they could not find any, they left the 
question open for future research. This paper 
takes up the task of comparing alternative 
exchange rate regimes where Helpman and 
Razin left it, with the aid of two ideas borrowed 
from the modern theory of general equilibrium 
with incomplete financial markets. 
The first insight of this literature for the pur- 
pose of this paper is the geometric approach 
to the role of money in economies with incom- 
plete markets (Magill and Quinzii, 1992). 


5 The literature following these contributions can be 


-> classified accofding tthe frictions generating the mone- 


tary nonneutralities that make the exchange rate regime 
relevant. In this paper nonneutrality arises because asset 
markets are incomplete and some financial contracts are 
written in nominal terms. Other frictions are price rigidi- 
ties (Mundell, 1961) and the use of seignorage as a tax 
(Matthew B. Canzoneri and Carol Ann Rogers, 1990). 
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Money matters because it changes the reallo- 
cations of income across states of nature that 
are attainable through trade in assets (it tilts 
the vector subspace of income transfers 
spanned by the matrix of asset payoffs) .° This 
idea serves to clarify the result in Helpman and 
Razin and is a central element of this study. 
The second idea borrowed from the theory of 
incomplete markets is that future price levels 
can depend, not only on shocks to preferences 
or endowments (economic shocks), but also 
on other sources of uncertainty called ‘‘sun- 
spots’’ in the general equilibrium literature 
[see Cass (1989) and Paolo Siconolfi 
(1991)]. These sunspots are the political 
-shocks that influence monetary policy in this 
paper and create the opening (that is missing 
in Helpman and Razin) for making a new case 
for fixed exchange rate regimes. 

The remainder of the paper is organized as 
follows: Section I describes the model; Section 
II characterizes the equilibrium geometrically; 
Section III establishes the welfare costs of 
monetary politics; Section IV evaluates the 
welfare effects of a monetary union; and Sec- 
tion V concludes. 


I. Description of the Model 


Consider a two-date general equilibrium 
model of an exchange economy with one per- 
ishable commodity. The world’s population is 
represented by the set H = {1, ..., H}’ and 


é The idea that changes in the price level have real ef- 
fects because they alter the space of income transfers 
across states of nature that are attainable through trade in 
assets was originally developed in David Cass (1985), 
Yves Balasko and Cass (1989), and John Geanakoplos 


and Andreu Mas-Colell (1989). See Heracles . 


Polemarchakis (1988) for economies with multiple cur- 
rencies. At the time when Helpman and Razin wrote their 
paper, the geometry of monetary policy in economies with 
incomplete markets was not well understood. This idea 
helps to understand not only Helpman and Razin but, also, 
other important articles in international macroeconomics 
such as Torsten Persson and Lars E. O. Svensson (1989) 


= and Svensson (1989). 


7 Notation: Bold upper-case letters denote sets, upper- 
case italic letters denote the number of elements in a set, 
and lower-case italic letters denote an element of the set. 
For example, H is the set of individuals, H is the number 
of individuals, and h is one particular individual. 


_. date J, one of S possible states js 

































individuals are indexed by h. The px 
the set H into countries defines 7 < 
individuals, H;. These countries 
by i withi E€ I = {1,..., Z}. 
Economic activity extends over 
Agents know the present, called s 
face an uncertain future. ‘At the seco 


Each of the states at date 1 is defineg’ 
realization of two types of shocks: ¢ 
and political. The economic uncertaj 
sociated with the realization of each 
ual’s income. The states of nature that 
on the realization of this type of un 
will be indexed by o and take values jn ia: 
£= {1,.., E}; E > 1. The political pg 
that influence monetary policy will 
sented by the random variable, 6, taki 
in the set © = {1, ..., ©}. It is posses 
write the set @ as O = ©, X O, X-#4 
and its elements as 6 = (6), 62, ..., Oi); EA 
preting each element of the vector 6 as tend 
litical uncertainty that is generated in oak 
i. Summarizing, a state at date 1 is s = (q ġ E4 
and the state space of the economy is {0X9 
S, where S = & X ©. The probability 
s occurring at date 1 is denoted by a(s 
In each state the quantity of money ti: 
injected in each country is ;M(s). Witsg 
loss of generality, it is assumed that fë $ 
countries ;M (0) = 1. Monetary policy inst 
try i is represented by a function Mis 
5. An independent central bank with! 
cretionary power will follow a rule wi 
is independent of the political shocks.” 


8 The money supply function ; M is consiste! 
empirical literature on monetary policy sh 
Lawrence J. Christiano and Martin Eichenba 
for a survey] and with many computational ¢ 
where researchers work with monetary policy n ®2 3 
form M = f(e) + £, where f(c) is a feedback 1 
depends on the state of the economy and € 15 
shock, that can be interpreted as a function of the 
shocks. A 
? The political shocks also can be interpreted 
_fulfilling beliefs about the. future actions of a 

monetary authority [Guillermo A. Calvo ( 
V. V. Chari et al. (1996)]. Under a discretion®? f 
policy makers can be pushed into pursuing 28 wee oe 

policy when the private sector, for whatever os a 
pects inflation, and monetary authorities find it! 
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"zZ v- setble to interpret the political shocks 
PP oy about the preferences of a be- 
t policy maker that maximizes a 
average of the utility attained by the 






































er tw under its jurisdiction at a competitive 
i state ORs epegm. This uncertainty can arise be- 
UN i prium. A 
second tadi m aiyate agents do not know the policy 
S iS real <<nreferences or because they do not 


efin % erty will be in power in the future. 
e individual has an endowment vector 


ertainty, Q, where 

each ‘ind 

re that. a {u' E Rit! [Yo € Z:w"(a, 0) 
of un beet | 

alues į w*(a, 6’) for every 6,0’ € O}. 
olitical 


definition of the set Q formalizes the no- 

that political shocks are distinct from eco- 

ones. Notice, however, that the two 

‘af shocks do not need to be statistically 
nt. š 


+ 


tor 6 asihe ‘fences are not stochastic and are rep- 
ated in gated by the linear-quadratic utility 
Lis s S (o) section - : 

jomy i í 

babilit Nex xt (0) 

d by z 

f money + È x(s)[ox"(s) — 3x*(s)*], 
M( s) N 7 sES 

ned that efit” « 

y policy A ‘defined over the consumption set 
ction ;4 Bees and where x"(s) is consumption in state 
yank with Wt &fder to guarantee monotonicity, it is as- 
a rule fed that e > D, cg w"(s) for all s € S.'° 
| shocks. ‘ cation of preferences specializes 


to the capital asset pricing model 
M)« It has the advantage of yielding a 
ed-fotm solution and of conferring very 


is consis trical properties to the equilibrium. 
Pichenbe eclirities that are traded in world fi- 
as Seni kets are an indexed bond that 
ary policy ‘Unit of the consumption good in every 
s a feedba 4 nominal bond denominated in 


ny and 


anction of.the 

be interpr j i i 
ctions of § of R 

A. Calvo output, or in terms of the need to 
a discre ie taxes) not to accommodate these 


pct ae across individuals does not alter 
ut adds to its notational complexity. 


each of the currencies.'' The nominal bonds 
pay one unit of money in every state; their 
real payoffs are the inverse of the price level 
in currency i. The payoffs of all the securi- 
ties can be collected in an S X J matrix of 
real asset payoffs, 











S | es coll 
ip(1) iP(1) 
A(p)=| i i > jo 
Ji 1 sà 1 
ip(s) P(S) 
where p = [ip, ..., ıp]. The column vectors 


of A, aj, represent the payoff of asset j. The 
row vectors of A, a(s), represent the payoff 
of all the securities in state s. 

As there are no trading frictions associated 
with trade in either assets or commodities, the 
law of one price must hold, i.e., 


(2) ej(Oiqk = jG for every i, j, and k, 


for every i, j, and 
sE{0}US, 


e;(s)p(s) =;p(s) 


where e;; is the price of currency j in terms of 
currency i, ;p is the price level in currency j, 
and ;q is the price of security k in currency j. 
These no-arbitrage conditions imply that the 
date 0 budget constraint can be written as: 


x"(0) + qy" = w*(0), 


where q € R” is the vector of real asset prices 
and y” € R” denotes the portfolio of individual 
h, with y} representing holdings of security j. 


-e eu 


1" This asset structure implicitly includes futures cur- 
rency contracts. The absence of arbitrage opportunities in 
financial markets implies that a futures contract to 
exchange currency i for currency j is equivalent to being 
long in the nominal bond denominated in currency j and 
short in the bond in currency i. . 
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The date 0 budget constraint states that con- 
sumption plus the value of the securities pur- 
chased at date 0 must equal each individuals 
endowment at date 0. 

At the second date, agents discover their en- 
dowment w"(s), and deliveries on the finan- 
cial promises made at date 0 are carried out. 


+~ The budget constraints are. 


x*(s) = w"(s) + a(s)y’ 


in each of the S states at date 1. - 
Summarizing, the consumer’s budget set is 


(3) B(p,q) = T ERS xR”: 


age ( Ta jyt, 

A(p)]° 

The money markets clear if, for s = 0, 1, 
anaes 


(4) iM(s) = ;p(s)w(s), 


where ,;w(s) is the aggregate endowment of 
country i in state s. These money market- 
clearing conditions are quantity equations with 
unitary velocity.’ 

Financial markets clear if 


È y'=0. 


hEH 


The economy is in equilibrium when indi- 
viduals choose their consumption and portfo- 
lios in order to maximize their utility in the 

` budget set B’(p, q), the money market- 
clearing conditions (4) hold, and asset markets 


The monetary institutions underlying this model are 
the extension to an international setting of those presented 
in Magill and Quinzii (1992). Money does not appear in 
the budget constraints because it is assumed that at the 
beginning of each period consumers sell their endowments 
to the central bank, then they trade in asset and currency 
markets, and at the end of each period they sell their 
money holdings back to the central banks in exchange for 
consumption goods. See the working paper version 
(1995) of this article for details. 


„surance opportunities available through 

























clear. Walras Law implies that the 
ket clears as well. 


Il. The Geometry of Equilibr 
An immediate consequence of the ‘ 


market-clearing conditions is that 
rium matrix of asset payoffs is 


| W0) 
iM(1) 
A(p*) = f 
p 9) 
ıM(S) 


where p* denotes the equilibrium price 
The convention that a starred variab| Ta 

cates its equilibrium value will be adonat Ga 
the remainder of the paper. r, 


the problem of understanding the reasbi w 
hind the existence of securities written inis 


money. If securities are standardized aie: 
countries some exchange rate variability (& $% 
makes price levels in different count 
tinct) is ‘‘good’’ because it increases esa 
in nominal assets. Observe that for any 
monetary policy the matrix A(p*) will bat 
full rank for almost any configuration of : 
dowments. On the other hand, the loss of 
etary independence entailed by fixed exa 
rates is costly: fixed exchange rates me" 
money supply endogenous and the prit 
in different countries colinear, inducing’! 
of rank in A(p)." 
The budget constraints (3) imply. 
excess demand of each individual at dalé 
to belong to the linear subspace spat: 


duced by a fixed exchange rate regime is the mas 
a floating exchange rate regime is better than §™: 
in Helpman and Razin (1982). 



























vat of A(p*). This hyperplane is 
rage marketed subspace (of R5) and its 
San ig equal to the number of nonre- 
tassets in the economy (the indepen- 
jumns of A), J. When asset markets 
“plete (i.e., J < S) the marketed sub- 
has a lower dimension than the con- 


mand that corresponds to an equilib- 
‘an economy with complete markets 
be feasible. Equilibrium excess de- 
zan economy with incomplete mar- 
be the closest point in the marketed 
“to the complete markets allocation. 
licy matters when changes in M 
marketed subspace. 

‘complete markets allocation is given 





Au + r*) is the price at date 0 of a 
iB tis init of the consumption good at date 


‘aré complete there is perfect risk 
OF full insurance. This can be readily 
rom the fact that under this allo- 
ources, each individual consumes 


mption about the functional form 
bhty function implies that the utility 
idl attain at an equilibrium can be 


) = ult) + d(x, wt) 


A 


ul(t") — d(t", xf” 


markets allocation can be easily com- 
the payoff matrix A by the identity 
on a in the Appendix. 


set at date 1, and, in general, the’ 
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where d(x, y) = = m(s)(x(s) — y(s))? mea- 
sures the distance between the points x and y 
in R5. The distance between the date 1 endow- 
ment and equilibrium consumption is an exact 


`- measure‘of the gains from trade in assets. The 


distance between the date 1 complete markets 
allocation and equilibrium consumption mea- 
sures the inefficiency introduced by the incom- 
pleteness of financial markets. Equilibrium 
portfolios pick the excess demand that mini- 
mizes the distance between the complete 
markets excess demand and the marketed 
subspace." 

Figure 1 illustrates the geometry of equilib- 
rium allocations in an economy with two assets 
and three states of nature at date 1. The graph ` 
is drawn in the space of date 1 consumption: a 
three dimensional space where each axis rép- 
resents consumption in one of the states. If there 
iS no trade in assets the agent can only consume 
at the endowment point, w”. Financial markets 
allow individuals to reallocate income across 
states of nature. However, the budget set, (3), 
constrains individual excess demands to lie in 
the marketed subspace. In this example, the 
marketed subspace is the plane in R° that con- 
tains the endowment point and is spanned by 
the payoff vectors of assets a, and a,. Figure 1 
shows that equilibrium consumption at date 1, 
x*", is the closest point in the marketed sub- 
space to the complete markets allocation, <7. 
The individual depicted in the figure attains this 
allocation with a long position in asset a, and 
a short position in asset a. 

Monetary policy has real effects because 
price levels determine the position of the pay- 


‘off vectors of assets a, and a, and, hence, that 


of the marketed subspace. For example, if the 
price level corresponding to the currency in 
which asset a, is denominated increases in 
state 3, then the payoff of asset a, in state 3 
(vertical axis) will be smaller, and the mar- 
keted subspace will tilt down. The equilibrium 
consuniption of agent h will change because it’ 
will cease to be feasible. With the new price- 


15 Technically, the equilibrium excess demand with in- 
complete markets is the -orthogonal projection of the 
complete markets excess demand onto the marketed 
subspace. 
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FIGURE 1. THE GEOMETRY OF EQUILIBRIUM 


level vector, equilibrium consumption will be 
the closest point to the complete markets al- 
location lying in the new marketed subspace. 
Due to the way in which the plane was tilted, 
it will be closer to the autarky point and it will 
be further away from the complete markets al- 
location. Individual h, a lender in currency 1, 
will be worse off. 

If country 2 pegs its currency to country 1, 
the payoff vectors corresponding to the two 
assets will become colinear, and individuals 
will only be able to consume on a point on the 
line that crosses their endowment in the direc- 
tion of asset 1. In general, this will lower the 
utility of A. 


il. The Welfare Cost of Monetary Politics 


The objective of this section is to formalize 
the idea that the excessive nominal (exchange 
rate and price level) volatility that arises from 
political shocks reduces the gains from trading 
assets and, hence, welfare. This creates the 
opening for making a new case for fixed 


exchange rates in countries where this volatil-- - 


ity is large. 
In order to analyze the welfare costs of con- 
taminating the real payoff of nominal assets 


- in an hypothetical economy where 


- the 7 countries. 


_ability can be exactly measured. Re 

























with political shocks, equilibrium. 
are compared with the ones that y 


policy is made independent of 6, b 
it by its expected value conditional: 
iM € R5 be a vector with elementg;: 
{2s (6/0) [1/,M(o, 8)]}~" and ies 
.., M] € R* represent monetary policy com 


The utility gains that each indivi 
derive from eliminating excess no 


equilibrium utilities can be written 

u"(w") + d(x#", wf). Let iz* and x hae 
equilibrium utility and consumption sca 
monetary policy M, respectively, It 
that the difference iv” — u" is 


a(t o 


assets: 


(6) iz” = u" as z! (A'A 
~ (A'AA)'Jz*2 


where z" = AA(r)[(w,/H) — wi] and 
trix A corresponds to monetary policy. Mf 
matrices A’AA and A’AA are the 
covariance matrices of asset returns W 
etary policies M and M, respectivel 
difference is the excess volatility introdus 
the political uncertainty, which i is a sem 

tive definite matrix. 


country monetary policy depends on 
shocks, individuals are diverse (z 
some 4), and asset markets are incom 
inequality in (6) will be strict for at least 
agent. 


PROPOSITION 1: Let there be mot. 
two agents in each country and fe 
than‘states of the world. Then it is 
true that any monetary policy that 
to political shocks supports an eq 
that is Pareto inferior to the one com: 























ition and its proof.) 

monetary policy that introduces po- 
any me" ty in the economy, there exists 
g policy, that is independent of 6, and is 


es : formal statement 
the Appendix for a 
(ser the 


rior. 7 
Fitton underlying Proposition 1 and 
-ion (6) is best seen in the simple case 


vV 

>, cov[(1/;p*), (14p*)] = 0 for all 
‘case, the expression for the welfare 
Jiminating the monetary politics re- 


| 2 ova 5 


iM o) 
SN ees 
var( —£ 
(a 
@è = w/H — w? and var[(w,/;M)| 


A= anlO) lw (o) M Co, 8) — wilo) 
TG) is 





sion (7) illustrates that any mone- 
ey for which M(o, 0) # M(o, 6’) 
ated by policy M. The term È, 
[(w,/,M)|o] is the expected value 
ance of national price levels con- 
n g; it measures the sensitivity of 
policy to political uncertainty. It is 
éè in (7) that the gains from elimi- 
cess nominal variability is an in- 
“function of this variable. If one 


ve ie 


8 ihi) = 0 and 2, n(s)var[(w;/4;iM)|s] > 0, 


ted in currency i will not be traded and 
de of equation (7) will be zero. If var(w;/ 
in Currency i will be a perfect substitute 
+ and the right-hand side of equation (7) 


ariance-covariance matrices are di- - 
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thinks that there is a country zero, where the 
price level is constant and equal to one, w;/ 
iM would be the exchange rate between cur- 
rency i and currency 0 and the term È, 
a(o)var[(w;/;M)|o] would be the excess 
exchange rate volatility." 


Equation (7) can also-be written as ~ mt tee 


ü’ — u" 





Do n(ayvar( 2 z) 


(ii) 
var i 
x5 moyvar( 


ala), 


The welfare loss that traders suffer because of 
the excess nominal variability in currency i is 
proportional to the square of their equilibrium 
trading position in that currency.'* 
Proposition 1 contributes to the general equi- 
librium literature on the welfare costs of mon- 
etary instability by showing that excessive 
monetary variability is socially costly because 
tampering with nominal financial contracts has 
negative effects on financial markets. This wel- 
fare cost of monetary instability forms the basis 
for Irving Fisher’s (1936) case in favor of mon- 
etary rules. Friedman’s (1960, 1962) case for 
rules rested, among other things, on his view 
that ‘‘(...) it is essential to prevent monetary 
policy from being a day-to-day plaything at the 
mercy of every whim of current political 


= Lore) 1+ 





17 If the value of the reference currency has a positive 
variance, then the relation between the variability of 
exchange rates and price levels is given by the formula 


var(e;) _ var(pi) _ vat(p,) 


Se et a — 


Elea) E? EPY 


cov (p; , Pr) 
E(p;)E(p.)’ 


where p, is the price level of the reference currency. 

' This expression provides a theoretical basis for 
Frieden’s (1994b) empirical observation that the special 
interest groups that exert the strongest political pressure 
to reduce exchange rate variability are those linked to the 
financial sector. 
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authorities.” (Friedman, 1962 p. 224). Thus, 
Proposition 1 can be interpreted in terms of the 
literature that studies whether a monetary au- 
thority should follow preannounced rules or 
whether it should be granted discretionary 
power to conduct policy. A fixed exchange rate 
regime can be viewed as a monetary rule that 
is socially desirable when it removes excessive 
exchange rate variability. This case for a mon- 
etary rule is different from Finn E. Kydland and 
Edward C. Prescott’s (1977) because their case 
for a rule is based on the gains from reducing 
the level of inflation, not its variability. 


IV. The Welfare Effects of a Monetary Union 


A monetary union entails the elimination of 
the U currencies of the countries that form the 
union and the creation of a common currency 
with its associated monetary policy. The dis- 
appearance of nominal assets in the merging 
currencies, and the creation of the new cur- 
rency, transform the financial structure of the 
economy. The new asset structure is repre- 
sented by the matrix 








A,(p) 
bless ae ty aed 
iP(1) y-uP(1) a«p(l) 
pee, ee, 
iP(S) 1-uP(S)  ,p(S) 


where the equilibrium price level in the union 
will be determined by the quantity equations 
uM(s) = .p(s)w,(s), and „M, „p, and w, are 
the union’s money supply, price level, and its 
aggregate endowment, respectively. 

A regime with permanently fixed exchange 
rates in which countries peg their currencies 
to that of a ‘‘center’’ country (like the United 
States in the Bretton Woods system) is equiv- 
alent to a monetary union, with „p, „M, and „w 
being the center country’s price level, money 
supply, and aggregate income; ‘respectively. 

The objective of this section is to evaluate 
the welfare effects of a monetary union. To 
that extent, the effects of reducing exchange 


- welfare costs of the excessive nominal} 






























rate variability will be isolated fro 
fects on the asset structure. 

The institutional design of the ne 
bank will determine the sensitivity ofti 
monetary policy to political factors, Ths 
of the new institutions, relative to the ol 
in insulating monetary policy from 
pressures can be measured by compas 


under the two monetary regimes. Thus, 
from switching to the union’s mone 
decision-making process will be (iz 
(i — ut), where the subscripts 0 and 
the economies before and after the 
spectively. If the design of the union's’ 
bank achieves the goal of divorcing mong 
politics, the difference #4 — uf will be 
the gains from switching to the currency 
will equal the costs of the excess nominal} 


of assets in the economy can be studi 
comparing the equilibrium utilities that ca 
attained in economies with the asset strucax 


represented by Ay and A,, i.e., i and Ñ, List: 


ces, instead of Ao and A,, is useful. 
separating the effect of changing the nual 
of assets from the effect of changing the ena 
nominal volatility. The loss of nominal 
in the U disappearing currencies and the 
ation of assets in the union’s currency fe 
the dimension of the marketed subspacé:® 
If the payoffs of the nominal bonds if 
new currency are a linear combination d 
payoffs of the nominal bonds in the: 
pearing currencies, then the market 
space generated by A, will be included 5 
one generated by Ay and, generally, W 
will be positive. This will be the cas. 
money supply of the union and its mem 
constant, and in the case of a move t 
fixed exchange rate regime. In the m 
eral case, when the new nominal bondsé 
in the previous marketed subspace, the! 
marketed subspace will have a lower &# 
sion but will also change its position 
— i, can be either positive or negat 
In terms of Figure 1, in the first E% 
change in the asset structure of the %4 








































MARCK "NO. 1 

ma = a: è the plane, spanned by the vectors a, 
; ane Se by a Jine in that plane. In the more 

new: ae 1:2 ase, the plane, spanned by a, and a, 

ft ae ae replaced by a line that is either closer 

Re haad away from the complete markets ex- 

eol ; “acnand of each agent. 

oma <<< discussion implies that, for each indi- 

yma ae the welfare effect of a monetary union 

nal in “= pe decomposed into the gains of changing 

‘hus, ari bis “y decision rules and the costs of chang- 

me ' fe number of assets in the economy, i.e., 

a uh = [T4 — uh) — (7t — uly] 

the unig 

mion go — (i — ii 

1g money. . i 

will be: POSITION 2: In a general equilibrium 

currency, with incomplete asset markets, nominal 


wsaetary union is desirable when the gain 
‘Eem eliminating the excess volatility of nom- 


ing th yariables caused by political shocks ex- 
the cost of effectively reducing the 

ities that cam] er of assets in the economy. 

asset st HEY 


i B The implications of this proposition are best 
i @ederateod by considering three special cases: 
k: aa Where a monetary union is Pareto inferior 


sing the ml ae $ = baving independent currencies because mon- 
nging thé tyga 237 ‘policy before the union is already insu- 
f nomina g Sd from political shocks; a second one, where 
zies and the ë union is Pareto superior because 


jasay policy before the union is determined 


d subsp cal shocks but after the union it is not; 
nal bon third one, that considers fixed exchange 
mbinatic es that fall short of a monetary union. 


first case, a monetary union is Pareto 
¥ tò a system of independent currencies. 


e 
ye inclu a world where there is a representative 
enerally Meach country, and agents are hetero- 
be the case across countries. Such a world can be 
oad its mel f as one, where within each country — 
a move ioga €xperience similar economic shocks or, 

eatly, as one where there are good risk- 
inal bo ces within countries but not across 
subspace onal borders.! In such a world, inde- 
vea lo tral banks that follow a constant 
5 ositio 


Markets and the welfare state are the 
modern economies have for sharing 


that 
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money supply rule will implement the complete 
markets allocation. Nominal bonds will be 
claims on each country’s representative agent’s 
income, and will suffice for effectively com- 
pleting financial markets (the complete markets 
excess demands will belong to the marketed 


. subspace). If any.subset of these countries de- 


cides to form a monetary union, the complete 
markets allocations will no longer be feasible 
due to the reduction in the number of assets, 
and the resulting equilibrium will be Pareto in- 
ferior to the multicurrency system (indepen- 
dently of the union’s monetary policy). This 
example suggests that, from the perspective of 
risk sharing, an optimal currency ‘‘area’’ is a 
set of individuals who are able to perfectly 
share risks among themselves, and use nominal 
assets to sell their collective risk.” 

In the second example, a monetary union is 
Pareto superior. Consider a two-country world 
where there are no indexed bonds, there is no 
economic uncertainty, agents want to transfer 
income intertemporally, and in each country 
the money supply is very sensitive to political 
shocks. Consumers will trade these nominal 
bonds to transfer income intertemporally, and 
this will make their consumption sensitive to 
political shocks. A move towards a monetary 
union with a single monetary authority that 
follows a constant nominal money supply rule 
will implement the complete markets alloca- 
tion that is Pareto superior to the previous 
equilibrium. In this example, the loss of one 
asset entails no welfare cost: in the absence of 
political shocks there is only one state at date 
1, and, hence, one asset with a constant real 
payoff is sufficient to complete the markets. 

Third, consider a monetary/exchange rate 
regime such as the gold standard, the exchange 
rate mechanism of the ERM, or the currency 
board implemented in Argentina in 1991. All 
these arrangements are rules that attempt to 





risks, this situation is consistent with a world where, for 
some reason; equity-portfolios exhibit-a home bias, and- 
where there are no supranational welfare states. 

2 In this respect, it is interesting to observe that the 
plans for a monetary union in Europe have also motivated 
proposals for supranational fiscal programs aimed at im- 
proving intra-European regional risk sharing (Bean, 1992; 
Eichengreen, 1993). 
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limit exchange rate variability, and fall short 
of an irreversible monetary union (or perma- 
nently fixed exchange rates). Under these re- 
gimes, the central bank follows a rule that fixes 
the exchange rate (or allows it to float within 
a tight band), and agents know that there are 
circumstances under which the rule can be 
abandoned (pessibly.on a temporary..basis).. 
Let the set of economic shocks for which the 
rule will be maintained with probability one 
be R C 5, and let R° be its complement. In 
the states belonging to R“ it is possible that the 
rule will be abandoned, and, as the decision to 
do so is a political one,” var[(w;/;M)|o] will 
be large. Expressions (6) and (7) imply that 
credible monetary rules, for which there is a 
large probability that the rule will not be bro- 
ken and, therefore, that var[(w;/;M)|o] will 
be small, are effective devices for reducing the 
welfare costs of monetary instability. On the 
other hand, the extreme circumstances under 
which policy makers may decide to depart 
from a preannounced rule (and var[(w;/ 
;M){o] may be large) might be precisely the 
ones where nominal assets are the most useful. 
For example, countries that are net debtors in 
their own currency might find it very valuable 
to repudiate part of their debt through inflation 
in times of great need (such as wars and major 
recessions ) at the cost of paying higher ex post 
real interest rates in good times. Summing up, 
the model implies that these hybrid regimes, ” 
with both rules and discretion, are a compro- 
mise between the benefits of removing all dis- 
cretionary powers from monetary authorities 


2! Examples of political decisions to abandon a mone- 
tary rule are ubiquitous in history. Under the classical gold 
standard, the currencies of the center countries where 
taken off gold during wars while in the periphery there 
were also other “‘critical’’ circumstances that triggered 
suspensions of convertibility (Eichengreen, 1992; 
Michael D. Bordo and Kydland, 1995). In Europe, some 
governments decided to abandon the ERM after German 
reunification, while others did not. The Tequila crisis in 
1994 (the decision by the Mexican government to aban- 
don its exchange rate regime) put pressure on. Argentina 
to abandon its currency board but the government decided 
to pay the political costs of sticking to its exchange rate 
commitment (Cavallo and Cottani, 1997). 

» For example, ;M;w = ,M,w in normal times and de- 
value when ;w is below a critical unannounced value. 































and the cost of losing the advantag E 
inal contracts as insurance devices, 


V. Conclusions 


Currency unions are designed wiy 
purpose of eliminating exchange rate 
ability and insulating monetary polici 
political pressures. At the same time 
change the tools available to share ; 
among nations. This paper develops a TORIET: 
eral equilibrium model in which thes o ; 
can be analyzed, and describes an ec 
environment where eliminating excetg 
monetary variability improves welf; 
shown that in an environment with p 
securities, incomplete markets, and 
with quasi-linear quadratic prefere 
monetary union is desirable when thë 
from eliminating the excess volatili 
nominal variables exceed the cost of ¢ 
ing the asset structure. we 

The analysis rests on the natural assie) 
tions that (monetary) policy depends of & 
realization of political events and that iss 
ance markets for political risks do noté 
Currency unions are viewed as rules ida 
eliminate (monetary) political risks by iit- $a 
ing the link between policy and polities: #3 $4 
democratic society, however, there ar pss 
reasons for tying policy to political deds 
Cutting this link may result in an allocats® 
resources that is socially undesirable ¢# 
though it allocates risks more efficiently.) 
interaction between evolving social pf 
ences, public policy, and an efficient alloca 
of risks merits further research. 


APPENDIX 


PROPOSITION A1: Assume that t ait 
more than two agents in each country | oe 
2 fori =1,..., I), asset markets aré it 
plete (J < S), and monetary policy 
isfies,;M(o, 0) + ;M(o, 0") for some% 
and i.. Then, for a generic set of € 

there exists a monetary policy M (ine 
political shocks) that supports an. 
rium that Pareto dominates the one 
by monetary policy M. 








i e R*® (except r), let 
Given ante f as R aes 
‘BE be the projection of x O} š 











onal matrix X =_(%1,...:»%), let X be 
s XJ ) . 


ige ratch Ey J dimensional matrix X = (x;, ve 

y policy z -= n(o) = Zo T(a, 0). A(x) is the 
ne tima matrix derived from the vector x. 

o shari eae 

velops ‘A generic set of endowments is an open and 


'h these isa of full Lebesgue measure in Q. 

- ee J, then there exists a generic set of 
 etewments, 2*, such that if w € Q* then 
- Sfa4}has full rank for a given monetary pol- 


‘see working paper [1995]). l 
ts, and imple computations reveal that, in 
prefere Mg scili,” 
when the a. rae Ee 
s volatili (Aly y’ = (A(p*)’A(r)A(p*)) 
cost of. cham». s 

$ aag ' w1 h 
natural d x A(p*) hen) wt), 
depends 

a 
ks do no 
as rules, cre wy 
risks DY fw o- (e-5) A(T) 
ne © XA(A'A(T)A) A’ A(T) 
slitical dei w fo 

rå a ` 
wore: A.W] +A(A'A(1)A)7! 
ng soc aoe wi, 
fficien A a(S ut) 
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where the prime ’ indicates transpose, A(7) is 
an S-dimensional diagonal matrix of probabil- 
ities and w; = Df < g w is the world aggregate 
endowment at date 1. Adding the first-order 
conditions, g = (c — x")'A(m)A, over indi- 
viduals, and using the equilibrium conditions, . 
>, x* = Ð, w”, implies that g* = [c — (w/ 
H)]'A(w)A(p*). This expression for equi- 
librium asset prices, the first-order conditions, 
and the budget constraints, in turn, yield [c — 
(w,)/H)}'A(m)A(p*) = (c — (w* + 
Ay"))’A(1)A. As, generically, A(p*) has full 
rank, A’AA is invertible, and y** = 
(A'AA)7'A’A[w,/H — wt]. Expression (A2) 
follows from (A1) and the budget constraints. 

Expression (A2) and the functional form of 
the utility function imply that equilibrium util- 
ity can be written as: 


u” = u(w") + e*'AA(A'AA)'A' Ae’, 


where e" = (w,/H) — wi}. Adding and sub- 
( 


tracting @*’AA(A’AA)~'A'Ae* and noticing 
that A‘Ae” = A‘ Ae” results in 


u" = u(w") + Z AA(A'AA)~'A’Ae* 
+e AA ((A' AA) 
— (A'AA)~')A*Ae" 
=i" + g" KA((A'AA)™! 
— (A'KA)7!)A'KS. 


Notice that if asset markets are complete then 
u” = a", Consider the term 


a ma u” 
= g" AA ((A' AA) — (A'AA) A Ae” 
= y*" (A' AA ĒD 
x (BA) = (A'A) ~) y**. 
Observe that (A’AA) — (A’AA) = (8), where 
X is a semipositive definite matrix with generic 
elements >, x(a)cov[1/;p, 1/,p|o]. It fol- 


lows that Ẹ* — u* is a positive semidefinite 
quadratic form and u" = i for all h. The 
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columns/rows of X corresponding to curren- 
cies with iP = ;p are zero. Let Ñ be the positive 
definite matrix that is obtained by deleting the 
Zeros from X. Then, the quadratic form can be 
written as 


where ¥*" is the equilibrium portfolio contain- 
ing only securities denominated in currencies 
with p+ pand-A is the corresponding positive 
definite submatrix of (A’AA)~'. Adapting an 
argument from step 3 in the proof of Theorem 
1 in Geanakoplos and Mas-Colell (1989), and 
being careful to keep the price levels in A(p*) 
constant when endowments are perturbed, it is 
possible to show that: if H, > 2 fori = 1,..., 
I, and J = S, then there exists a generic set of 
endowments, 2**, such that for w € Q** the 
matrix [y', ..., y’] has full rank, i.e., all the 
securities are traded (see working paper 
[1995]). It follows that if w € 9**, then 
y*" + 0 for some A. Therefore, if endowments 
belong to the generic set Q* N 2**, then 


u” — u” > 0 for some h. 
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The Ultimate Externality 


By Jon D. HARFORD* 


A unified formal theoretical framework for 
addressing externalities and population issues 
seems to have heretofore been lacking. Kerstin 
L. Kiessling and Hans Landberg (1994 p. 
xviii) clearly state that population growth has 
an external effect, but refer to no formal 
model. Partha Dasgupta (1993 pp. 350-51) 
argues informally (only) that childbearing cre- 
ates an external effect when common property 
resources are not properly priced. In a paper 
with results in the same spirit as those found 
here, John O. S. Kennedy (1995) analyzes the 
impact of immigration in the presence of neg- 
ative externalities and finds that the original 
population can be harmed by immigration 
even with externality taxation. 

The present paper addresses the issues of 
childbearing choices and externalities in a 
two-generation model of a competitive econ- 
omy with a pollution externality created by the 
aggregate consumption of one of two goods. 
For concreteness, the externality will be inter- 
preted as pollution, but other interpretations 
are possible. In fact, an interpretation based 
upon a public good externality, such as might 
result from educational activity, is entirely 
compatible with the mathematical model. 

It is assumed that the utility of each member 
of the final generation is a function of her own 
consumption and the level of pollution. Each 
member of the initial generation experiences 
utility based on her own consumption and pol- 
lution exposure, but each also derives utility 
from the utility of her children. The combi- 
nation of a narrow intergenerational concern 
and a public bad externality leads to the result 
that Pareto efficiency requires not only Pigo- 


* Department of Economics, Cleveland State Univer- 
sity, Cleveland, OH 44115. The author wishes to thank 
John O. S. Kennedy and his colleagues Myong Chang and 
David Yerger for feedback regarding the ideas in this pa- 
per. The author also thanks three referees for many de- 
tailed comments that helped shape the final version of this 
paper and eliminate errors. The author takes responsibility 
for any remaining errors. 
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vian pollution taxes, but a tax per child th 
of which approximately equals the p 
value of the pollution taxes each descen 
will pay. Intuitively, this result follows 
the fact that each member of the initial ‘px 
eration should pay a Pigovian tax for 


charged for the externality created bý, 
child. 


The present results can be better undergtòné Wik 


Daniel F. Spulber (1985) regarding the Pigs 
vian taxation of pollution from firms. S% 

taxation gives each existing firm the prope! 
centive to limit pollution, and efficiently lime 
the number of firms by raising the equilib 
average cost via the added tax burden. T 
authors point out that tax revenues mutt ti 
rebated in lump-sum fashion because reba 
to firms would inappropriately offset the eat}: 
limiting effects of pollution taxes. In the pi 
ent context, all individuals in the final pon 


ates no fiscal externality in her generatioa# 
offset the technological externality that # 
creates. Thus, in the absence of an approp 
childbearing tax, a member of the initi 
eration has an inadequate incentivė to 

‘‘entry’’? of her children into the.% 
generation.’ R. 


I. The Two-Period Model 


There are two generations: the genera 
the parents (generation 0) and the geat 


' Another perspective on the need for both P 
and childbearing taxes has been offered [by 2 ® 
Total pollution in the final generation is a pro% > 
population and per capita consumption of the 
good. To control both population and per capit@ 
tion one needs two instruments. 











































dren (generation 1). Each genera- 
lives one period and the periods do not 
<" The population of the parents’ gen- 
jg No and the population of the chil- 
neration is Mı. Each member i of 
jinn 0 produces Mo, children iii an asex- 


child ~aner so that each child has only one 
the Each member of the initial generation 
l F ena š same income and tastes, and each pro- 
ollow Aa ; 


children that are like all other children 
and equilibrium income. It is assumed 
-for ‘optimizing problems discussed result 
iS Casey sie interior solutions. All variables, including 
j ymber of children, will be treated as 


{eiir My, Ai], where ‘“‘c’’ and “*x” refer to 


( 1974 assumption goods that are produced in com- 
ling th tive markets. ‘‘A,”’ is interpreted as a 
1 firms,:8 e bad and will be called pollution. 
the pro 

ficient! 

re equil Mi 

burden, A= È, Xm 


nues mi g 
ecause FCO ; i aggregate consumption of good x 
iffset the €n a the final generation. 
es. In the ‘fis assumed that j will maximize her utility 
he final to a budget constraint of the form 
taxes eq fo 

bys + hj = Cy + (qi + hh )xy, 
ional c 
r gene the competitive supply price of 
nality ìà terms of the numeraire good c;; t 
fan appt t unit tax on good x, ; b, is a lump- 
the initi ate which will equal the equilibrium 
itive to efla amount of taxes paid (b+ = t,X,+); 
into j#8'a level of income that is exogenous 


€wpoint of person j, but partially 
ed by the bequest of capital left to j 
Parent i. The presence of tax-related 
1) and other equations is in antici- 
ee i Sgae policy role they will play. 
d the gen e of person j is Iy = ko(1 + 
po o Where w, is the competitive wage for 
f labor each person is endowed 


d sites tè bequest of capital received by 
is apr I, and y; is the competitive rate of 
set i : ae R generation 1’s time. Further- 


3 AKo ), where Ky is the total be- 
; her ny; children. 
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Given this budget constraint, a member j of 
generation 1 would maximize utility by satis- 
fying the marginal condition 


(2) (OU; / 3x OUy / cy) 


=q, +t, — (3U; / IAU y / cy). 


Equation (2) indicates that the marginal rate 
of substitution of c, for x, by j will equal the 
full price of x, in terms of c,, which includes 
the ‘‘nuisance’’ term on the far right. This 
term reflects how many units of good c that j 
would give up to reduce a unit of her pollution 
exposure that she influences by her consump- 
tion of good x. Equations (1) and (2) define 
the choices of x,; and c,; as functions of (J; + 
b,«) and t. 

The utility function of a member i of the 
initial generation is assumed to be 


(3) Vor = Uoilco:, Xoi, Ao] 


+ Bro Uyj[ey, Xy, Ar}, 


where Up; is a concave function of initial pe- 
riod consumption and pollution, Uj; is the util- 
ity of each of the children of person i of the 
initial generation, and 


No 


Ao C >, Xos- 


s=] 


Person i weights the sum of utilities experi- 
enced by her children by the generational dis- 
count factor 8, where 0 < B < 1. The 
definition of Vj; roughly captures the idea that 
a parent cares more about her children than 
about unrelated individuals. Except for the ex- 
ternality, the assumptions on Vp; are similar to 
those of Gary S. Becker and Robert J. Barro 
(1988)? 


.ewi > x 


2 Instead of the constant discount factor 6, Becker and 
Barro use the form a[n] = an~* with 0 < @ < 1 and0 < 
e < 1. Becker and Barro’s more general form implies the 
per-child generational discount factor shrinks as the num- 
ber of children increases but that a[n]n increases in n. 
The basic nature of the results of this paper would remain 
unchanged for any discount factor function with n as its 
sole argument as long as it produced an interior solution 
to the optimizing problem. 
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The income J; of a member i of generation 
0 is considered a parameter that has the value 
Ty for all i. The budget constraint for i will be 
written 


(4) bo» + Io; = Co + (qo + to) Xoi 


+(p+T)no; + Ko, -- 


where go is the competitive price of good x in 
terms of numeraire good co; p is the constant 
cost per child; Ko; is the total cost of all be- 
quests to her children; fy is a constant unit tax 
on good xo; and T is a constant tax per child. 
Finally, box = foXos + Thos, where Xox and Nox 
refer to the common equilibrium values for xo 


and no, is the per capita lump-sum rebate of- 


tax collections. 

Maximization of Vo; by the choices of co; , 
Xoi» Moi, and Ko; subject to (1) and (4) yields 
the following marginal conditions: 


(5a) (Uo; / xoi) / (OU; / Aco ) 
= Go + bo 


a (OUo; / Ao) / (Uoi / cu), 


(5b) (OU; / Oc) / (OU; / Aco; ) 


=1/{80 +r)}, 


(5c) p+T 

= {Uy / (Uo: / Aco: )} 
= {(OUi;/ Bey) / (Uo: / Icoi) } 
x (1 + 71) Bo; 
= Pno: { (OU, / ðA) 

+ (Uo; / Aco; ) } xy 

= {BU / (Uoi / 3c) } — kor 

— Brig { (OU; / AA;) 


+ (OU; / Ico) } xij- 
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Equation (5a) has an obvious intep% 
analogous to (2). Equation (5b) indic 

(at the margin) the number of units ofh 
sumption she will give up to increase 
her children’s consumption by one unit; 
versely related to the product of the ga 
tional discount factor and the gross rag 
return on capital, Equality (Sb) was Usea 
obtain the final version of the right-han 
pression in (5c). A rearrangement of (50,23 
dicates that the utility-maximizing numbe; a 

children will be such that the full private je! 
of a child (p + T + ko) times the mar 
utility of income to the parent will equal 
increase in utility derived from having 
more child endowed with the same reso 
as the others. The increase in utility from; 
ing one more child is slightly reduced by 
far-right term in (5c), which reflects 
impact that one more child. will have on 
siblings via the increased pollution she cr 


Il. Pareto-Efficient Resource Allocation 


It is assumed that for each generation th 
is a separate all-knowing, costlessly acti KA 
planner who has the goal of achieving P 
efficiency from the viewpoint of members: 
her generation. The rationale for assumitgé g 
separate planner for each generation is that ths 
more accurately reflects the realities of poli: 
making.’ Each planner takes the other's 
tions as given. In generation 1, Pa 
efficiency will-be produced by maximizin 
utility of the ‘‘representative individ 
whose choices are notationally reflected ia 
dropping of individual subscripts. In effe 


*Two virtually indisputable assumptions lead OFS 
the conclusion that the only welfare functions ey 
implications of which have any hope of becoming! 
(except by accident) are those which are a function 
preferences of members of the current generation. 1 
assumption is that individuals act according t0 = y 
preferences. (This is really just a definition off n 
ences.) The second assumption is that only the 4% ¢ 
directly affect policy.choices made in their gen4% 
other words, the dead and the unborn do not vole i 
biological viewpoint, this author would presume 
have altruistic preferences towards our children a 
because they do not have the power to create ane’ 
themselves until they become members of the ” 
tal generation. 






















chooses the common values of c, and 
to the per capita income constraint 
ko +w = C1 + qX. The planner 


qa Maximizing U,[ci, xı, A] subject 


ne 3 ; Y.. 4 i 
= Nyx me constraint yields the efficiency 


grossifa 
ight- iani [ƏV / cı) 

nt of (Sey e / dx) ee) i l 
i ori > qı — N,(OU; / 8A,)/(OU, / de,), 

i the; : 
will {is interpreted as both supply price 
n hav ‘Sauilibrium rate of product transforma- 
ame (7 Sea The marginal rate of substitution between 
ility fro E -Sande displayed in (2) will equal that re- 
duced d for Pareto efficiency as indicated in (6) 
h reflek ; 


@valasted at the efficient allocation. The right- 


renerati alde represents the negative of the sum 
istlessl inal rates of substitution of the numer- 
chieving:Pa goed for pollution for the (N, — 1) 


iber? members of generation 1. This is the 
x :. €aditional Pigovian tax. 

| Favet, efficiency for the initial generation 
hieved if the representative individ- 
€$ a maximum level of Vp with re- 


tion Í tê choices of the common values of 
1 maximi? and K, subject to the per capita in- 
ve ind traints on generation 1 and on gen- 
y refle th..the latter being I = cy + 
ipts. In + Ko. Thus, the initial-period plan- 


gnize the impact of x) on Ap = 
> Impact of no on A, through the 
DA, = NX, = Nomox,. Assuming 
Satisfied, the necessary conditions 
fficiency for the initial generation 


nt generally : 
ccording BAOn )(BUy to) 
hat only 3 
K T No(OUp / Ao) / (3U / co), 


U, /c1)I(BUs / eo) 
{8 + r) ie 
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the impact of x, on the value of- 


(8c) p= {BU, / (OUp / deo) } — ko 
+ BnoNox:{(OU, / 0A1) 
F (OUp / ðco)}. 


Comparing equation (8a) to (5a), it follows 
that individual trade-offs can be made consis- 
tent with efficient trade-offs between the 
goods c and x in period 0 by setting to = — (No 
E 1)(3Uo/3Ao)/(3U0/co). 

Equation (8b) is identical to (5b) except for 
the individual subscript, indicating that the un- 
modified individual trade-off between current 
consumption and total bequest is. consistent 
with efficiency, as long- as tf, is determined by 
(7). This result is more interesting than the 
reader might suspect. Martin L. Weitzman 
(1994) used a bare-bones model of a dynamic 
economy to argue that pollution externalities 
imply a social discount rate that is below the 
unadjusted rate of return on capital due to what 
he called ‘‘environmental drag.’’ The present 
model shows no such divergence between the 
unadjusted rate of return on capital and the so- 
cial discount rate. However, Harford (1996) 
has shown that results similar in spirit to 
Weitzman’s, but significantly different in de- 
tail, occur when the present type of model is 
modified so that the production of the capital 
good is polluting.* Thus, the present results in- 
dicate that the social discount rate will not dif- 
fer from the unadjusted rate of return when 
pollution results from producing a noncapital 
good, while Harford’s result confirms that 
‘“‘environmental drag’’ will modify the dis- 
count rate when production of the capital good 
is polluting: 

In equation (8c) I have used (8b) to re- 
duce one term to ky. A comparison of (5c) 
with (8c) indicates that individual utility 


-x -oar ee 


4 Weitzman’s model has only one good and no explicit 
utility function. His expression for the social discount rate 
is algebraically and conceptually more complicated than 
Harford’s (1996). The reader is referred to equations (17) 
and (18) in Weitzman (1994 p. 205). In the Harford 
model the social discount rate becomes (1 — t)r, where r 
is the unadjusted rate of return on capital and ¢ is the pol- 


lution tax rate as applied to the appropriate generation. 
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maximization can be consistent with Pareto ef- 
ficiency only if 


(9) T= —Bno(No — 1)m- 


x { (OU, / 8A,)/(OUp / 3ca) } 
os SSA ttn Cloke r). 


X-{(N; — mo)/(N, — 1)}, 


where the last expression on the right results 
from using (8b). 

As (9) indicates, the childbearing tax on the 
parent should equal in value the present value 
of the pollution taxes that a child would pay 
in her lifetime times a factor that will equal 
one if parents each have one child. The factor 
is less than-one if each parent has more than 
one child since the parent internalizes the. ef- 
fect on siblings of pollution from each child. 

The first equality in equation (9) has an in- 
tuitive interpretation. If another unit of pollu- 
tion is created in the final generation, each 
parent in the initial generation will suffer a loss 
equal to — Bg { (OU;/0A,)/(OU5/ Oc) } when 
evaluated in terms of the numeraire good. 
Therefore, a parent giving birth to another 
child producing additional pollution x,» im- 
poses a loss on (No — 1) other parents equal 
to — Bno (No = 1)xı» { (OU,/0A, y/( OU / 
Oco) } . The childbearing tax thus prices the ex- 
ternal cost a child imposes on other parents via 
the external cost a child imposes on their 
children.” 

The model can be readily extended to many 
generations. In such a model the extended util- 
ity function of a member of generation 0 
would contain a term for each future genera- 
tion g of the form (8 non = ng -U;), where 
the pollution argument of U, will have the 
form A, = Nonon * Ng — 1X,. It has been shown 
by Harford (1996) that the total Pigovian 
childbearing taxes on person i of generation 0 
would equal the discounted present value of 


sg 


5 Leslie A. Whittington et al. (1990) found that the per- 
sonal exemption for dependents has had a positive and 
significant effect on the national birth rate of the United 
States. This suggests that a childbearing tax would have a 
significant impact on such choices. 
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the pollution taxes paid by all des 
i in all generations. 


I. Concluding Remarks 


With a mixture of positive and ne 
ternalities in a two-generation mode} 
that the childbearing tax. would appro: 
equal the per capita net present value ofr 
ovian taxes and subsidies paid by one’; 
Among economists and others, there; 
agreement about whether population 
a ‘‘problem,’’ which in present terms 
interpreted as a disagreement about the 


existence of costly and imperfect infor 
the same problems that arise in the imp] 
tation of virtually any policies intended gg BM 
fect childbearing or other externalities” Wii 
the present paper ignores these kinds of 
tial complications, it has the virtue of alloj 
both childbearing and externality-genénta 
consumption choices to be endogenous #3! 
model where the disproportionate concen é # 
parents for their own children is recognizéé.' 
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The Impact of Educational Standards on the Level and 
Distribution of Earnings . 


By JULIAN R. BETTS* 


In recent years, the role-of standards in im- 


proving the quality of public education has 
received considerable attention from econo- 
mists. Recent examples include Suk Kang 
(1985), William Becker and Sherwin Rosen 
(1990), and Robert M. Costrell (1994).' 
These papers are important in the sense that 
they view education as a principal-agent prob- 
lem. This contrasts with the main paradigm for 
empirical work on school quality, which at- 
tempts to estimate a ‘‘production function,”’ 
in which purchased inputs such as teachers and 
length of the school year are assumed to add 
value to the ‘‘intermediate input,” students.” 
The theoretical literature on standards points 
out that academic achievement may have as 
much to do with incentives as with measures 
of school resources such as spending per pupil. 

An implication: of the existing theoretical 
models of educational standards is that an 
egalitarian planner may prefer lower standards 


* Department of Economics, University of California- 
San Diego, La Jolla, CA 92093. This research was sup- 
ported by a grant from the American Educational Research 
Association which receives funds for its ‘AERA Grants 
Program’’ from the National Science Foundation and the 
National Center for Education Statistics (U.S. Department 
of Education) under NSF Grant #RED-9452861. Opinions 
reflect those of the author and do not necessarily reflect 
those of the granting agencies. I also thank UCSD for re- 
search support for initial phases of this work. I thank 
Robert Costrell, Vincent Crawford, and Joel Sobel for 
helpful suggestions. 

' A related contribution by Andrew Weiss (1983) care- 
fully documents that education can act both as a sorting 
mechanism and a source of human capital if students are 
passed or failed based on a test of achievement. The paper 
by Weiss does not focus on the impact of a change in 
educational standards per se. For related work on how 
firms may use apprenticeships or other ‘‘tests’’ to identify 
the best workers, see J. Luis Guasch and Weiss (1980, 
1982). 

. ? See Eric A. Hanushek (1986, 1989, 1991, 1996) for 

excellent reviews of the literature on education production 
functions. For recent reviews on the relation between 
school resources and earnings, see Betts (1996) and David 
Card and Alan B. Krueger (1996). 


than an income-maximizing planner, j 
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an increase in standards will aggray 
equality. This point is made most Clearly 
thoroughly by Costrell (1994) who iiss 
Kang (1985) and Becker and Rosen (19855 
models earnings as an endogenous functiog 
educational achievement. An increase jp’ 
cational standards raises wages fof 
workers who choose to meet the stay 
while leaving the wages of those who 
meet the standard unchanged. Costrelf (IPE 
p. 960) concludes that ‘‘As semen x 


ent finding: an eae ea wake A 
prefer higher standards than would a pa 


of earnings. The result is based on the ob 
vation that if workers are differentiated 
respect to ability, an increase in educa 
standards will increase the earnings of bob 
most-able and the Jeast-able workers. Thei 
workers whose earnings fall are those wos 
who after the increase fail to continue n 
the standard. 
Section I develops the model. The oe 

parallel in the literature is Costrell (19% 
The key distinction between the present 

and Costrell’s is that the present moda è 
sumes that students differ in ability. Hi 
Costrell model, all students are assum i 
equally productive if they exert zero eff 
students do differ in preferences. 
developed here is similar to the MO ii 
Kang (1985) and Becker and Rosen (17253 
that it assumes. heterogeneous abiliti 
stead of assuming that the ‘“‘rewards’ 
to meeting the educational standard af 
as do these two papers, the present P 
that of Costrell, allows the resulting 3 
be an endogenous function of the P 
ties of workers who select into ea". 






























eful extension since it uses a stan- 
-function to explain why students 
about their performance in school, 
vi fecourse tO ad hoc assumptions about 
wing the educational standard affects 


I. The Model 


“Pegaider a hybrid human capital/signaling 
st clea a ‘et, which attendance at school adds to 

Cari yra activity. Firms can observe the standard 
ach student meets. In particular, they 
tify those who graduate from high 
d those who do not. This information 


eiad 


‘student has an initial level of achieve- 


sa f òr ability) of a, where a is a random 
itari oe yanáble distributed on (a, a] with probability 
the gradi f- dxsity and cumulative distribution functions 
g an imponi jis) and F(a), respectively. Student i maxi- 


utility, which depends on leisure (L,) 
k: ged lifetime carnings (w;): 


W= UL, w:) 


imize subject to L; € [0, L]. 

1 on the 7 , , 

srentiated a ational achievement, 7, is defined by 
wn edt ed tional production function g, which 
ngs of bot] on both ability and effort. The value 
rkers. 1 product of a worker of ability a; who 
» those en a level Li of leisure is directly pro- 
ntinue educational achievement, with a 


€.value marginal product of one 
cational achievement: 


ie 


St. g1 > 0, g: >0, 2, <0, 


82 = 0, 22 = Oanda >Q. 


Bete that. the first argument of 2; L > L,, 
SStudent effort. The school sets a stan- 


rosen ( ; Mp 

‘bilities BARE : minimum level of achievement 
EF +» Pass, which is denoted 7,,. 

dard aa cannot observe the productivity of 


: Rais rectly. But firms do observe 
sulting SMA Pell T. eo has met the standard im- 
f S Sage Sool, and thus they infer his 
Productivity, Perfect competition in 
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the labor market ensures that workers are paid 
tbeir expected product conditional on whether 
they have met the standard. Workers belong 
either to group 1, which consists of students 
who have met or exceed the standard Ts, OF 
group 2, which Consists of workers who have 
not met the standard. Wages will be constant 
within each group due to the ‘unobservability 
of ability. Conditional on reaching the 
achievement level needed to belong to group 
j, j = 1, 2, the worker optimizes by choosing 
the maximum level of leisure possible. 


RESULT 1: If some students choose to reach 
the standard and others do not, then there ex- 
ists an ability level a* such that workers 
choose group (1/2) as a 2 a*. For workers 
in group 2, who do not meet the standard, 
L; = L. For x, < g(0, @), the educational 
achievement of students is given by the follow- 
ing equation: 


8(0, a;) < T, ifa; <a* 


Ts if ai Slat iat] 
8(0, a) > x, 
if a; E€ (a**, a], 


where a** is implicitly defined by g(0, a**) = 
Ts, and a* is the ability level at which a 
worker is indifferent between belonging to 
groups 1 or 2.° 


PROOF: 

For all workers in group 2, wages and lei- 
sure are identical at values of aE(ar|L = L) 
and L. Utility is thus independent of ability in 
group 2. But utility is nonstrictly increasing in 
ability if a worker chooses group 1, the mem- 
bers of which must achieve or surpass the stan- 
dard 7,: 


a 


* If standards are high enough, so that r, = (0, a), 
there will be no students who exceed the standard. In this 
case, the equations in the model can be simplified by set- 
ting a** = @. All of the results in the model continue to _ 
hold in this simplified case. 
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au 
da 


dL 
(4) -F( 


~ aL \da 





r= ) 


>0 fora € [a, a**) 





NE Ts 


=0 for a = a**. 


Thus if workers of ability a* are indifferent 
between the two groups, all those of (lower/ 
higher) ability will strictly prefer group (2/1). 
The remaining statements follow directly. 


The equations which follow assume that the 
standard is nontrivial, in that at least some of 
the students who meet the standard must exert 
positive effort to do so: a* < a**. 

The expected productivity, and hence the 
wage, for workers in each group can now be 
calculated: 


(5) w! = a@E(n; |; = 7) 


a* 


f ar,f(a)da+ | ag(0,a)f(a)da 


1 — F(a*) 


and for workers who do not meet the standard, 


(6) w? = aE(n;|7; < T) 


a f g(0, a) f(a) da 
F(a*) ‘ 


where all workers optimize by setting L; = L. 
What happens when schools raise the stan- 

dard 7? For students who continue to meet or 

exceed the standard, their average level of 

achievement is likely to rise, and hence so will 

their wage: 

ay (2) =e - F@*)) 

dr, 1 — F(a*) 





a>a* 


The first term is positive. It is straightfor- 
ward to show that dw'/da* = 0. The sign of 



















da*/dr, is likely to be positive, as wij} 
cussed below. l 

For students of lower ability, the è 
higher standards depends crucially 9 
of da*/dr,. As a* increases, the avertips 4 
ity of workers in the bottom gro 
thereby increasing the wage paid to this 6 
More formally, differentiate (6) with 
to 7,: : 





a<a* 


_ dw? da* _ f(a*) 
“da* dr, F(a*) 





(0, a) f(a) da 

x } g(0,a*) Fat) oa 
da* 
aa 


The term in braces is the gap between 
ductivity of a worker of ability a* and thé; 
erage productivity of workers of ability b 
a*, subject to all such workers choosing; 
L. Since this term is positive (i.e. df 
da* > 0), the wage paid those in the [et 
group rises or falls with a rise in school s# 
dards as (da*/dx,) 2 0. 
I will make the reasonable assumptiof: 
after an increase in educational stan 
fewer students will meet the standard. Int 
words, da*/dz, > 0. This will in fi 
shown to hold for any stable equilib 
First, let a,, denote the ability level of the 
ginal worker (in equilibrium or out of 
rium). Denote the marginal worker's 1 


where L*(a,,, Tms) is implicitly define et 
Ts = g(L — L*, am). In equilibnuf 


tI thank Robert Costrell for suggesting tis E 
argument. a 






















worker is indifferent, so A(a*, 
The comparative static derivative is 
i —(OA/8n,)/(OA/0a,,) at a, = 


unambiguously sign 0A/Or, < 0. 
illustrates, showing the indifference 


a of ability a*, who is originally indif- 
een the pairs (L, w°) and (L*(a*, 
$5 Consider the effect of an increase 


paid to those meeting the standard. 
an of ability a* will lower his utility (to 
indifference curve ) if he chooses to 
the new standard. (A leftward move 
giang ;the production possibilities frontier 
= ees him onto a lower indifference curve.) 
ity is now invoked to obtain the sign 
A/Oa,,. Suppose that the margin am varies 
tantaneously in response to the utility 
meeting the standard: 


bf ability bees 
| choosing gm: scat‘decline as an rises. Applying the chain 
am zc. the condition is 9’[0](OA/da,,) < 0. It 
- sgemedlately follows that at any stable equilib- 

p- Sein, GAN ĝa > 0, and thus da*/dz, > 0. 
h the educational achievement of 


eee The impact of a rise in x, on a* 
mae y ambiguous, but in any stable equi- 

sS ldr, > 0. In this case, if the ed- 
Standard is increased, educational 


iy 2X E s x £ 
M a Make the relationship between the standard 
ave clear, in the figure I assume that no 
eS dard, so that w’ = ar,. This is not 
A p results to be true. 
ersion of Costrell (1994) makes a 
ell (1993 footnote 9). 


sd production possibilities frontier for ` 


dard and the accompanying increase _ 
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L* (a*r) L 


Leisure 


FIGURE 1. OPTIMAL CHOICE OF WAGE AND LEISURE FOR 
WORKER INDIFFERENT BETWEEN MEETING AND NOT 
MEETING THE STANDARD 


Note: The figure assumes that no worker exceeds the stan- 
dard, so that w' = ar,. 


the standard with zero effort, their achieve- 
ment will not change after a change in the 
standard. After the increase in the standard, 
achievement falls for the infinitesimal mass of 
workers at the margin, and remains constant 
for workers who before and after the increase 
in standards choose group 2. The most impor- 
tant conclusion is that a marginal increase in 
educational standards increases the wages of 
the workers of highest and lowest ability. The 
only workers whose wages drop are the work- 


- ers on the margin who no longer meet the stan- 


dard. Although of infinitesimal mass, these 
marginal workers suffer discrete wage drops, 
while all other workers experience marginal 
wage gains. i ee 


II. Implications 
A. Solving the Planner’s Problem 


The main result of the paper is that a social 
planner who values equity may set higher 
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standards than a social planner whose goal is 
simply to maximize the sum of earnings. This 
differs from the findings of Costrell (1994). It 
derives from the earlier result that in a model 
with heterogeneous but imperfectly observ- 
able abilities, the wages of both the most able 
and the Jeast able will rise after an increase in 
the educational standard. Following Costrell, 
assume that the social planner does not value 
the leisure of students, and instead seeks to 
maximize some positive function of students’ 
earnings after they leave school. 


RESULT 3: Consider an egalitarian planner 
who maximizes the sum of a concave function 
of workers’ incomes: VE = F(a*)h{w?(a*)] + 
(1 — F(a*))h[w'(a*, 2,)], where dh(y)/ 
dy > 0 and d*h/dy* < 0. This planner may 
choose a standard higher or lower than would 
the income-maximizing planner. 


PROOF: 
See the Appendix. 


Thus in a model with heterogeneous abili- 
ties, we cannot assume that concerns about in- 
equality should oblige policy makers to set 
lower standards. A more egalitarian planner 
may prefer a higher standard because it leads 
to higher average ability in the bottom group 
of students, and thus higher wages for the less- 
able students, a benefit which is more highly 
valued by the egalitarian policy maker. As an- 
other way of seeing this, note that if a Rawl- 
sian planner were given a choice between two 
educational standards, both of which entailed 
some students meeting the standard, the plan- 
ner would always prefer the higher standard 
because it is closer to the pooling equilibrium 
which will maximize the earnings of the least- 
able worker. The model of Costrell does not 
come to the same conclusion because in that 
model all workers who fail to meet the edu- 
cational standard are equally productive in the 
labor market, and so the wage paid to this 
group of workers is independent of the group’s 
membership.’ 


7 Note that in the proof of Result 3, the sign of dV®/ 
dx, when evaluated at the income-maximizing planner’s 
optimal standard is ambiguous precisely because the wage 
of the less-able group, w°, varies with the standard. This 


-model which illustrates that it is in 


utility function and the educational p 


B. An Example 






















Consider a simple example of th, 


sible that a more egalitarian planner wij’ 
higher educational standards. Assume th 


function take the forms: 


(1) U= (L) w) 
subject to L; € [0, 1] 
(2°) VMP: = īū; 


assumptions, it is easy to calculate wages f 
both groups of students and a* as a fune 
of the standard 7,. For all nonnegative vals 
of a*, it can be shown that da*/dn, > 0.5 
The planner’s objective function is aa! 
elastic function of each worker’s income: 








that ever 

2\2 

(11) v= Fax) I 
(w!)? ie 
+ (1 — F(a*)) a or everyt 


The planner’s elasticity of substituti 
tween the income of any two workers 18 Bg closer to t 
by 1/1 — z. Thus the income-maxill 
planner has z = 1. Egalitarian planners s 
z < 1. The lower is z, the more egalitan x Sahi le c 


the planner, and as z > —% we appro ea 

case of the Rawlsian planner for whom 5'e. SEa Ad ba 

welfare depends only on the income of re nt.] 

son with the lowest income in the popty ` inte 
The planner’s problem was solved SER 

cally for various values of k; where k¢ aa the p 


will not occur if all members of the bottom 5” 
equally productive. l 

* Note that the educational production func 
sen so that productivity of the lowest-ability W% g 
be positive even if he exerts no effort while 18 


-n 












































tance of ability relative to effort in 
educational achievement. The re- 


of theg eran in Figure 2. For each value of k, 
is inds co onal standard is normalized to 1 at 
ner will -facilitate graphing.” 

Ssume: 1 and k = 10, the more egalitarian 


er (the lower is z), the higher he 
educational standard. The effect is 
for the case k = 10, where the vari- 
ducational achievement due to innate 
latively high. In this case, a mod- 
galitarian planner (with z of approx- 
:0.745) reaches a corner solution, 
ran educational standard so high that 
In this way he achieves complete pool- 
by raising the earnings of the less- 

‘students. The egalitarian planner favors 
yely large increases in w° that result 


1 z 


: perf ore-able workers dropping into the 
ational sf students who do not meet the standard. 
t abil “Fee (fie case k = 1, where ability does not con- 
1] Un bute as strongly to achievement, it requires 
ulate ‘alightly more egalitarian planner to set the 


ird so high that nobody passes it. 
3 that a planner who is so egalitarian as to 
the pooling outcome will be indifferent as 
"E Silicther we achieve this outcome by setting 
“§-Seadard so high that nobody can attain it, or 
‘ew -that everybody can attain or exceed it 
Tero ‘effort. In other words, there are two 
y Rabson to the planner’s problem whenever the 
outcome is for all students to be placed 
te Sse group: either everybody is allowed 
everybody is made to fail. Although 
these extremes is particularly realistic, 


substitutloy à Case to be made that the former situ- 
worke: Closer to the status quo in American pub- 
ome-m is. [See for instance Lorrie A. Shepard 
an pl ‘Lee Smith (1990), who estimate that 
ore egali le of 13 states and the District of 
we app the average proportion of students 
for who d back a grade in a given year is 


Percent] 

se Interesting case is k = 0.5, where 
gee ange in which it is true that the more 
4 anner, the lower the planner 


Goes not Convey information about the rel- 

-atear optimal standards for different values 

formalization used. As one might expect, 

a; ses With k. For z = 1, the optimal 
Id between k = N S and L = 1N 
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1.2 


1.15 


po 
. 
pami 


1.05 


Normalized Education Standard 


0.95 





——K=0.5 — - K=1 - - - K=10 





FIGURE 2, OPTIMAL EDUCATIONAL STANDARD VS. 
PLANNER’S PREFERENCE PARAMETER Z 


Notes: Optimal standard is normalized to 1 for z = 1. 
K = ability coefficient in educational production function. 


should set the standard. For this level of k, the 
differences in productivity by ability are so 
small that the mildly egalitarian planner sets 
educational standards lower than does the 
income-maximizing planner, to increase the 
proportion of the stúdents who meet the stan- 
dard. The losses in w? that result from the drop 
in average ability in group 2 are minor evén 
from the point of view of this egalitarian plan- 
ner. But as in the other two cases, there comes 
a point beyond which increasing the degree of 
egalitarianism requires the planner to set 
higher standards in order to guarantee higher 
wages for the least able in the population. 
This illustrates the result that regardless of 
the relative importance of effort and ability 


in nronducina academic achievement as the 


ood 


’ 
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planner’s preferences tend toward those of the 
Rawlsian planner (z > —), the planner will 
begin to raise, not lower, standards. 


C. Extension of the Social Welfare Function 


Result 3 assumes that planners are con- 
cerned with the earnings of students rather 
than with their utility. The next result indicates 
that highly similar intuition obtains for plan- 
ners who maximize a function of individual 
utilities. In addition, a simple sufficient con- 
dition emerges under which egalitarian plan- 
ners will set higher standards than would a 
planner who weights all workers equally. 


RESULT 4: Consider an egalitarian planner 
who maximizes a concave function of students’ 
utility functions, J? h(U(L, w; a)) f(a) 
da. This egalitarian planner may set a higher 
or lower educational standard than the 
“‘utility-maximizing’’ planner who maximizes 
J? UCL, w; a) f(a) da. Assume an interior 
solution for the utility-maximizing planner, so 
that the first-order condition is met. Then a 
sufficient but not necessary condition for the 
egalitarian planner to set a higher standard is 
as follows: at the optimal standard for the 
utility-maximizing planner, for all students of 
ability a > a*, where a* is the ability of the 
marginal worker at the optimal standard for 
the utility-maximizing planner, (dU(L*(a, 
T), w')ldx,) < 0. Here L*(a, 1,) is implic- 
itly defined by 7, = g(L — L*, a), i.e., L* is 
the maximum amount of leisure consistent with 
a person of ability a reaching the standard r,. 


PROOF: 
See the Appendix. 


The intuition for the result that an egalitarian 
planner may set higher standards is highly sim- 
ilar to that for Result 3. The least-able workers 
see a rise in utility when the standard is raised, 
since their wages rise without a concomitant in- 
crease in effort. Wages rise for the bottom group 
because of the assumption of heterogeneous pro- 


‘“ ductivity among workers who exert no effort. In 


contrast, utility must fall with an increase in the 
standard for those with ability just above a*, 
since in the relevant region for these workers, 
the indifference curves will cut the production 
possibilities frontier from above, as in Figure 1. 
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‘that all workers above a* (or riéarly all) 































(Note that for the indifferent worker witha 
a*, the optimal standard would be at jays 
of tangency between the production po; 
frontier and the indifference curve, whig} 
be at a lower standard than the current 


lower indifference curve. Thus it is co 


see their utility fall as the standard is 
pecially if a* is near @. The egalitarian ņ 
would then set a higher standard than the ua 
maximizing planner since the former diseiya 
the utility losses to the most able while weig 
heavily the utility gains that accrue to the 
able as the standard is increased.'°"! 


IO. Concluding Remarks 


The central finding of the literature on ed! 
cational standards is that higher standai %4 
crease the achievement of only the mogak 
students (or those with the lowest rate | 
count). Thus, the literature argues, a po 


want to set relatively low educational standi? 
The central finding of this paper is thd: $ 
workers differ in ability, and if firms cas 
fully perceive individual workers’ producti 
ity, then an egalitarian policy maker might: 
an educational staridard which is higher. 
that set by an income-maximizing planned 
This finding derives from the observation 
if workers vary in ability, then an incremés 
increase in standards will increase the wage 


a 
eo tk os eae gr pe 
Dy Oe ee edi Oe epee 


10 Eor a worker with much higher ability that 
becomes possible that the tangency might lie @ 
current standard. Such a worker will see an 1 
utility if the standard is raised. On a related note, [™s è% 
of Result 4 in the Appendix shows that the sulle 
not necessary condition for the egalitarian plane is 
a higher standard requires that there must be 00*" a 
above ability a**. That is, all students must exe 
effort to achieve the standard. Otherwise thes?" 
would see an increase in earnings and no increase 
and their utility would rise with the standard. 

1 In Costrell’s model, the egalitarian plante 
lower standards since w? is not a function of whe 
out.” But Section U, subsection B, of his pap 
an interesting argument which suggests that an ©b-~ 
planner might set a higher standard if he puts less | 
on those students who are least willing to W 
cause of ‘‘their greater capacity to enjoy leisut 


i 


Zoe EPS. 



























the most-able workers but also those of 
ble workers. Those workers who do 
the educational standard benefit from 
-z> in the standard: as the pool of work- 


w 
nt sta do not graduate grows, firms correctly 
ion pos ee the average quality of this group to be 
kers¥oe and adjust wages upward. 


life, this idea may have some relevance 
stigma attached to dropping out of 
bably smaller in societies where 
w people graduate from school. The 
js smaller because firms realize that even 
ble workers may fail to graduate if stan- 
set too high relative to the resources 
in education by schools and families. 
“policy of higher educational standards will 
jnerease the earnings of all workers, given 
s‘some students on the margin will reduce 
ir effort and their earnings. But the ma- 
of.students, who lie both above and be- 
‘the critical ability level, will see increases 
‘thelr earnings. Such considerations suggest 
š In the case of educational standards, the 
&42-off between equity and efficiency may not 
severe as commonly thought. 
course, in real life a policy maker with egal- 
views may not realize the benefits that ac- 
j weaker students from the tightening of 
: The policy maker might mistakenly fo- 
Sæ on the reduction in utility to the marginal stu- 
f- ihe standard is raised, without paying 
fit‘attention to the benefits gained by the 
e-students. In such a case, the policy 
‘May set standards which are too low. 


APPENDIX 


YEOOF OF RESULT 3: 


ibe result follows by evaluating the ega- 
lanner’s first-order condition at 


wi 


4 
w 


Tj da* 
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the value of m, which solves the income- 
maximizing planner’s problem, 7,. First, solve 
the income-maximizing planner’s problem: 


__ max V'= F(a*)(w?) + (1 — F(a*)) (w'). 


The first-order condition implicitly defines 7,: 


da* 
an, 





f(a*) (w?(a*) — w'(a*, m,)) 


dw?(a*) da* 
+ Fla") ae ET 
dw'(a*, Tm.) _ 


+ (1 -— F(a*)) PE 


0, 

where to simplify notation a* is implicitly 
evaluated at 7,. Next, evaluate the left-hand 
side of the first-order condition for the egali- 


tarian planner at 7, and the corresponding 
value of a*: 


d 


a* 
dn, 





f(a*) (h(w2(a*)) — h(w!(a*, m,))) 


dw?(a*) da* 


EGE CG) 





t (1 = F(a*))h'(w'(a*, Ts)) 


dw'(a*, T,) 
dr, ‘ 


Substituting for (1 — F(a*)) from the first- 
order condition for the income-maximizing 
planner’s problem, and rearranging, 


[h'(w?(a*)) —h’(w'(a*, 1,))] BO. 
De 


+ 


ine 





+ 
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The signs of the indicated parts of the equation 
are based on the facts that his a concave func- 
tion and that w? < w’. Thus the optimal ed- 
ucational standard for the egalitarian planner 
may be above or below tt. 


PROOF OF RESULT 4: 


__ The first-order condition, for the planner 
who maximizes the sum of utilities is: 


dv’ — dw? da* 
SiS * ye Oe 
Ti F(a*)U,(L, w Yat i 


“* dU(L*(a, #,), w! 
F f EO D tajaa 


a* 


dU(L, w')dw' 
dw! ig, lT EC 


= 0. 


In the above equation, the optimal standard 
for this planner is denoted by #,. For the egal- 


itarian planner the derivative of the social wel- 
fare function is: -> ii 


E 


ALSO Ee 
ie = F(a*)h'(U(L, w?))U,(Z,.w?) 





dw? da* 
da* dr, 





+f h'(U(L*(a, x), w')) 


adU(L*(a, x), w! 
Or 


zy 7 1 
+h UE, w) Tewi dw 
w T 


y 


x {l- F(a**)], 


>.“ 


Evaluating the latter at the values of a* and x. 
which maximize V”, it is found that the egal- 
itarian planner may choose a standard which 
is higher or lower than the optimal solution for 
the nonegalitarian planner (7,): 
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za f [A'(U(L*(a, z), wy} 


a* 





dr, 


Ñs 


— h'(U(L, w?))] 


dU(L*(a, 7,), w') 
x dr, Fai 3 


+ [h'(UCZ, w')) 
—h'(U(L, w?))] 


adU(L, w') dw! 
2 dw' dr, U1 -Fla 


= 
< 


The last term is negative, by concavity 
ambiguity arises because for workers 


workers of ability just above a*, utility 
clines with a rise in 2,. Thus, the first inté 
could be positive or negative due to ambigay' 
of the dU(L*, w')/dr, term. ; 
A sufficient but not necessary condition i. 4 
the egalitarian planner to set a higher standad 


dU(L*, w')/dx, < 0. Note that this con 
requires that there must be no workers 
ability a**, since such workers would al 
gain in utility from a marginal increase i 
as their leisure would remain unchanged’ 
their wage would rise. Thus under the 


equation vanishes. Under the sufficient bala 
necessary condition, then, dV/dr|s, >? 
the above argument, and by the concavily 
h and the fact that U(L*, w') > U(L, w} 
all a > a*, which together ensure that the 
term in the above equation is positive. 
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Competition over More Than One Prize 


By DEREK J. CLARK AND CHRISTIAN Rus* 


In many situations, competition may be 


used as a means for providing incentives: Sup- ~~ 


pose that a firm is restricted to pay the same 
wage to all-workers who have the same job (as 
is broadly the case for lecturers at Norwegian 
universities, for example). This situation may 
arise due to a union-imposed restriction, or it 
may be voluntary.' In order to provide an in- 
centive to make the workers exert extra effort 
without breaking the uniform wage restriction, 
the firm may instigate a contest among work- 
ers with in-kind prizes such as paid leave, a 


family holiday, or promotion.” Those who ex-. 


ert the most effort may, on the basis of past 
experience for example, expect to win the 
prizes, but losers cannot have their effort re- 
funded; that ‘‘bids’’ are sunk makes the all- 
pay auction a theoretical construct which can 
be used to model this situation.* 

One drawback with the all-pay auction 
framework which limits its empirical applica- 
tion, however, is the concentration on the case 
in which a single prize is available (see 
Michael R. Baye et al., 1989, 1990; Arye L. 
Hillman and John G. Riley, 1989). In this pa- 
per, we extend the complete information ver- 
sion of the all-pay auction to allow for multiple 
(homogeneous) prizes. Our extension can be 
used at the theoretical level to examine 


* Clark: Department of Economics, NFH, University 
of Tromse, N-9037 Tromsø, Norway; Riis: Foundation for 
Research in Economics and Business Administration 
(SNF), Gaustadalléen 21, 0371 Oslo, Norway. Financial 
support from The Research Council of Norway is grate- 
fully acknowledged. We would like to thank seminar par- 
ticipants at the University of Bergen, Karl Ove Moene, 
Michael Wallerstein, and two anonymous referees for 
helpful comments. Any remaining errors are our 
responsibility. 

' George P.-Bakér et al. (1988) note the prevalence of 
horizontal equity in organizations. 

? Baker et al. (1988) indicate that promotion-based in- 
centives are common in corporate America. 

_* Fora comparison of the all-pay and more traditional 
auction forms, see Erwin Amann and Wolfgang Leininger 
(1995, 1996). 


_ Surprisingly, one result from the sin 































whether established properties of the ` 
prize all-pay auction carry over to th 
general case. 

One example is the ‘‘exclusion princi: 
of Baye et al. (1993) in which the admi 
tor of a single-prize all-pay auction’ 
wish to exclude contestants with a hi 
ation of the prize in order to increase 
revenue. We demonstrate that this res 
not necessarily hold in the several-priz tá 


and provide intuition behind this result: Ka 


case which does carry over to our more 
setting is that symmetric contestants wil 
pectationally) completely dissipate the 
of the prizes in their bidding (see Hill 
Dov Samet [1987] for the single-prize čik% 
we show, however, that the equilibrium ings 3 
case is not necessarily symmetric. 

As mentioned, one application of our 
prize all-pay auction is to worker incéaty 
We examine the type of distribution ma 
nism, simultaneous or sequential, which 
might use in order to stimulate extra ef 
the face of a uniform wage restriction: A? 
example, consider the internal labor m 
a firm which is looking to promote n W 
to the next level of its hierarchy. Thes 
motions form the prizes in an all-pay 8l 
where those who exert the greatest eff 
In distributing these promotions, the 
several options: promote n workers § 
neously, or divide up these promotions 
stigating a sequential contest. If the 
maximize the expected extra effort ¢ 
workers, we show that the perceived 
sition of the workforce determines 
these options a firm should choose b 
ing two mechanisms. First we presen 
prize all-pay auction in which all ” 
awarded simultaneously (to the ” 
with the highest effort); then this ™ 
tended to cover the case in which t 
are available sequentially (and p° 
blocks, i.e., several prizes available. 
round). We characterize the un! 


















































in each mechanism and calculate 
effort of each worker as well as 
pility distribution of the beneficia- 
os ain example taken from the model, 
Fs onstrate that a firm with a dominant 
Sho has a much lower cost of effort 
wo 
ane in the contest by distributing 
simultaneously, whereas if no worker 
y minant, sequential distribution is to be 


F= ana of using a contest to inspire extra 
; etka G lared to the literature on the 
w oF rank-order tournament as an optimal 
‘hee contract in the presence of moral hazard— 
r alia, Edward P. Lazear and Sherwin 

k (1981) and the extensions of their work 
ae a ¥esry R. Green and Nancy L. Stokey (1983) 
Š: a4 Bary J. Nalebuff and Joseph E. Stiglitz 
“A common assumption in these pa- 
at workers have identical preferences 
that worker performance is a random vari- 
‘which is related to effort, which itself is 
sacbiervable; the random components of per- 
famae are drawn from a common distri- 
$455 s0 that workers are ex ante identical and 
Ee fogus is on a symmetric Nash equilibrium 
ia fife Strategies. This assumption is reason- 
ac in their work since the tournament is set 
period (worker lifetime) framework. 
frast,, our approach emphasizes differ- 
sia ‘the participants in an environment 
“rank order of worker performance 

p tw be assessed. It is reasonable to assume that 


stants’y ' 

sipate thera 
ee Hillmaay 
le-prizeccum 


6tmation on the abilities or type of 
eagues.* Those who are unsuccessful 
Promotion contest may remain in the 
ñd ĉompete again at a later date, giving 
a chance to assess the ‘‘types’” of 

es. In contrast to the tournament 
¿which all workers compete (using 


x ? 
Squilibrium of our complete information game 
Ra as the limit of a game in which the valua- 
Players ‘are statistically independent and are 
On; as the variances of the distributions 
Valuations are drawn approach zero, the 
game of incomplete information ap- 
$ Strategy equilibrium. This is an ex- 
cation theorem” of John Harsanyi 





uld maximize expected to- ` 
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a symmetric strategy), we show that known 
differences between workers lead to an equi- 
librium in which only a subset of the workers 
are active in the contest. 
` Our results also have implications and ap- 
plications in the field of rent seeking. Ever 
since this term was coined by Anne O. 
Krueger (1974), economists have been inter- 
ested in the amount of resource waste which 
arises as a consequence of the competition for 
rents; Krueger’s example was the competition 
for import licenses. Somewhat surprisingly, 
analysis and implications have been based al- 
most exclusively on formal models involving 
a single rent. Our models extend this simple 
case. With n identical, indivisible prizes, we 
demonstrate that the amount of rent seeking 
which arises is sensitive to whether the prizes 
are awarded simultaneously or sequentially. A 
further application of our models is to ration- 
ing by waiting in line; our results extend those 
of Leif Johansen (1987) who uses an all-pay 
auction to model a queuing situation. 
The paper is organized as follows. Section 
I presents and analyzes the all-pay auction in 
which the highest n bids all win a prize, while 
Section I examines an extended model in 
which blocks of prizes may be distributed se- 
quentially. The results of the models are ap- 
‘plied in Section III to the case of worker 
incentives in the face of a nondiscriminatory 
wage policy. Section IV discusses further ap- 
plications and implications of our results in the 
areas of rent seeking and goods rationing by 


- queuing. 


I. Simultaneous Distribution 


Imagine an all-pay auction in which there 
are n = 1 identical prizes to be won. There are 
N players.who we shall rank according to the 
value, v; (i = 1, 2, ...,,N), which they place 
upon winning a prize. The players can only 


5 In an imperfectly discriminating rent-seeking contest, 
the decision rule of the contest administrator is taken to 
be stochastic so that players increase their probability of 
winning (or their share if the prize is divisible) by making 
rent-seeking outlays. S. Keith Berry (1993) and Clark and 
Riis (1996a, b) have considered this type of contest with 
several homogeneous, indivisible prizes and identical 
players. ae . 
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win one prize and, as the prizes are identical, 
v; is independent of which prize is won. To 
simplify the model assume that v, > n, > 
-- > vy.° The valuations are common knowl- 
edge. The model consists of two stages: in 
stage one, the N players simultaneously 
choose an outlay (sunk bid) x; = 0; in stage 
two, the n prizes are given to the n players with 
the highest outlays. If two or more players bid 
the same amount, we assume that the proba- 
bility of winning is equal for these players. We 
distinguish between active and passive partic- 
ipation by assuming that an active participant 
has a positive probability of bidding x > 0, 
while a passive participant will always bid 
x= 0. 

Let G; (x; ) represent the probability -that 
player i wins a prize if he contributes x; and 
all of the other players follow their respective 
equilibrium strategies. Assuming risk neutral- 
ity, the expected payoff of player i is thus 


(1) G; (xi) Vi — Xi. 


At this stage we should note an alternative in- 
terpretation of the differences between the 
players as expressed by the v; parameters. 

- Since (1) can be reformulated as v;[G;(x;) — 
(x;/v;)], 1/v; can be seen as the marginal cost 
to player i of making a bid; thus we can either 
think of player 1 as having the highest valua- 
tion of a prize or the lowest marginal cost of 
making a bid. 

The equilibrium of this game is necessarily 
in mixed strategies.’ Let F;(x) represent the 
cumulative density function of player i’s equi- 
librium mixed strategy; denote the upper sup- 
port of F;(x) by v/ and the lower support by 
vi. Proposition 1 characterizes the unique 
mixed strategy equilibrium for this game. 


é If two or more players have an equal valuation, this 
leads to multiple equilibria. For the case of a single prize, 
see Hillman and Samet (1987), Baye et al. (1990), and 
Tore Ellingsen (1991). : a 

7 Notice that a zero bid is considered as participation; 
thus simply being in the set of contestants is to be inter- 
preted as being a (passive) participant in the all-pay auc- 
tion. This is a technical assumption only; in equilibrium, 
the probability of winning with a zero bid is zero. 

® This is proved in the Appendix. 
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PROPOSITION 1: There exists à 
mixed strategy equilibrium of the 
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We see from (3) and (4) that the 
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the distribution is used. 


We next derive the probability for each 
siyer of winning a prize in the unique equi- 
f:-teeigm in Proposition 1. From (3) it is 
B. ssichiforward to show that the conditional 
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ae symmetric, viz. 
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čan use this symmetry to calculate the 


that a particular player wins a 


Define q; as the probability that player 


in a prize conditional upon x; = 


»2,..,0+ 1. 
first player 1. The probability that 
oses, given that all players j = 1, 


choose x; = x|, is equal to 1/ 
nce the conditional probability 


ate identical for each player. Hence 
to in + 1). If we go one step 
‘find qi = (v¥2/v,)"q}, since the prob- 


F the Usual case of n = 1. Players-l-and 2% - 


ively; 1 submits a Strictly positive bid, 
Probability v,/v, of making a strictly pos- 
=.1 in (3), as the players both choose 
gle interval, the equilibrium strategy is 
Interval [0, v.] by use of the uniform 
x/v2 (player 2 uses this distribution 

tting a strictly positive bid). These 
to those in Hillman and Riley (1989). 


a zero bid.” According to Proposition ~~ 
canifibrium is characterized by n por- 
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shy? v! > Vi+ 
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. : vi] and [vi, Vasil- Player n 
is from the entire interval [0, v, + ı ] (i.e., 
tions) in equilibrium, as does player 
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ability that all of the other players choose x; = 
vi , given that they choose x; = v}, is equal to 
(v2/v,)"."° Next, q} = (v/v) alv)" q, 
since v$ is player 2’s lower support, thus 
player two chooses x, > v| with probability 
v/v, while the remaining players choose 
x; > v\ with probability v3/v,. . 

Consider next player i. The probability that 
player i loses given that all other players 
choose x; = v! can be written l 


qi = (qi +qb + = +q)-1)0 


: : 1 
+ 1— Poca eh caso wes Ss a an 
(1 -qi= 42 q Deoa 
=(1-— bo ak aes eet T 
qi q2 qi-} are a. 


Taking into account the symmetry of the con- 
ditional distribution functions it follows that, 
fori <s, 


3 v, n+t2=s s-l yj : 
qi = (2) I (2)a 


Hence player i’s probability of winning a prize 
when the n highest bidders are all successful, 
p;(n); can be written 


(5) pi(n) = 1 —qi*? 


n+] v, f 
sief (2)a. 


ge MV 


Given the probability of each player win- 
ning a prize, it is straightforward to calculate 
the equilibrium expected bids. From Proposi- 
tion 1, we have that the net surplus of player 
iis v; — V,+13; however, the gross surplus is 
pi (n)v,. The difference between the gross and 
net surplus is the expected bid, x; (n). Thus 


(6) x(n) = pi(n)v; — (Vi — Ynei): 


10 To see this note that the probability that player j 
chooses x; = v{, given that he chooses x; = v3, is 1 — 
Fu lx = v4) = nn. 
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The total of the expected bids is simply the 
sum of the x;(n) in (6) for all n + 1 active 
players. 

From the single-prize all-pay auction of 
Hillman and Samet (1987), we should expect 
to approach full rent dissipation when the 
players approach symmetry in their valuations; 
this means simply that the total value of the 
expected bids approaches the total value of the 
prizes on offer. This is indeed true in our 
model. As the valuations approach a common 
v, the players use the same mixed strategy in 
(2) and (3). Thus p;(n) ~ n/(n + 1) and 
x;(n) 7 nv/(n + 1), so that the sum of bids 
is expected to be nv, which is the total value 
of the prizes. 

One result which does not carry over so 
easily from the single-prize case is the ‘‘ex 
clusion principle’? of Baye et al. (1993). 
When n = 1 we find x,(1) = v,/2 and x.(1) = 
v3/2v, so that the total of the expected bids is 
always decreasing in v,; this forms the intui- 
tion behind the exclusion principle in which 
contest revenue can, in some circumstances, 
be maximized by excluding the player(s) with 
the highest valuation from the all-pay auction. 
In the multi-prize framework this is not nec- 
essarily the case. As a demonstration, consider 
the case n = 2 for which we find win 
probabilities 
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3v? 
V3 v3 
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with corresponding individual expected bids: 
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Notice, however, that z(2) may incte 
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Generally, as V,+1 > 0, the upper 
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(10) Gir! (x )v 


+ (1- GI"! (x))6(v, -v") -x 
= GI! (x) [(1 5)v; + ôv] 
=x, + (vi — v’) 


=GP! (Xj) Up) — xX; + ôv; — v”). 


Clearly, the two-stage game comprising stages 
r — 1 and r is equivalent to a one-stage game 
inserting u; -1 for v; . Observe that the ranking 
of the players is the same in the u-series as in 
the v-series. Hence, using Proposition 1, 
among the set of players who have not previ- 

; ously won, the n,_, + 1 players with the high- 
est valuations participate actively in stage r — 
1 and obtain an expected payoff equal to 
[(1 — 6); + 6v"] — [C1 — jv"! + yu] + 
6(y, — v) = (1 — 6)(y% — v" t) + ôf - 
v”), where v’~ ' is the valuation of the lowest- 
ranked active player in stage r — 1. Consider 
next stage t. By backward induction it follows 
that the payoff function of player i who is ac- 
tive in stage t = 1, .., r — 2 is, 


G1) Gi(%) — 4); + 60°] 


— x, + 6(v; -5') 


= Gi (x) uj, — x; + (v: — D*) 


r—2 
where Y'= (1 — 6) $ 6%")! 
A=? 


+ prs iy", 


It follows from Proposition 1 that the expected 
payoff of a player participating aċtively in 
stage t is (1 — 6)(y; — D') + (v; — v’). 
From this discussion and Proposition 1, the 
following is straightforward to show: 
PROPOSITION 2: There exists a unique 
mixed strategy equilibrium of the extended all- 
pay auction which is characterized as follows. 
At each stage t = 1, 2, ..., r the n, + 1 re- 
maining players with the highest valuations all 
actively participate. The loser of each stage 


a 
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t= 1, ...,r— 1 goes on to the next round to 
compete against the n,, highest ranked of the 
players who have not yet actively participated. 
The equilibrium mixed strategies are as given 
in Proposition 1, replacing v; by u = (1 — 
6)v; + 6v'. Considered from stage 1, the ex- 
pected net surplus of actively participating 
player i = 1, 2.,n + Lis O(I — 6) 
(vi ~ 0) + (v; — v’)], where t(i) denotes 
the first stage at which player i actively 
Participates. 


Notice that u;, is simply a weighed average 
(with 6 as the weight) of a player’s own val- 
uation and 7‘ which is common to all players. 
In order to compare this extended framework 
with that of Section I, it is useful if the n prizes 
have the same value in both models. Thus we 
concentrate on the limiting case in which 6 > 
1; in this case, u;, is the same for all players 
in stages t = 1, ..., r. From Propositions 1 and 
2 we find that at each stage t = 1, ..., r — 1, 
there are n, + 1 active participants who all 
have a strictly positive bid; the equilibrium 
bids in stages 1, ..., r — 1 are taken from the 
common probability distribution 





(12) Fix) =1-(1- £ F 


Vr+l 


over the interval [0, v,,.,]. Thus the mixed 
strategy used at any stage 1, ..., r — 1 is de- 
pendent upon the number of prizes awarded at 
each stage only, independent of the individual 
valuations. At stage r, there are n, + 1 active 
participants, one of whom randomizes be- 
tween submitting a zero and a positive bid. 
The total expected level of bids in the first 
r — 1 stages turns out to be independent of 
how the prizes are distributed across these 
stages as well as of the number of stages, i.e., 
independent of {n,}, t= 1,..., r — 1. To see 
this, consider the expected bid by player i who 
actively participates in stage ¢ = 1, ...,r—1: 


ere oo PATA 
a= BERGA Matt: 


0 n, 


Since the number of active players is n, + 1, 
the total of the expected bids at stage t = 1, 




























.,r— 1 is nv,+1; summing over 
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awarded over the first r — 1 stages, A na 
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prior to r is @, where x 
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Thus a player’s probability of losing 


final round depends upon the round in wet’ Š , 
the player entered; this in turns depends ipa $ 
the number of prizes distributed at each sz: | 
Hence the total probability that player 
first participates actively in stage t =; 
r — 1 wins a prize (1 — (1 — p:(ħ 
pends upon how the awards are distributen.® 
a consequence, this distribution also affecti f 
total of the expected bids for the whole 08 
with n prizes. As in the model of Section É 
net surplus of player i = 1, ..., n + 1 
limiting equilibrium approaches v; 7;"% 
This means that, as we approach symm 
the players’ valuations, the expected vam. 
total bids is equal to the total value; 

prizes. i 
One special case of the general model 
is simple to analyze in the limiting equ! 
(6 > 1) involves one prize being aval 
each of r rounds, n, = 1 Vt = 1, =x 
cording to Proposition 2, at stage tZ 
r — 1, the two remaining players with UX 
est valuations compete actively ag 
other with bids drawn from the 
tribution over [0, v,.,]. The final 
identical to the usual one-prize C#: 
player n + 1 competing against th 
stage r — 1. Consider player i’s prob 
winning. Since in all stages 1, + £ 
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fis an interesting difference between 
smultaneous and sequential models as the 
ga": Valuations approach a common value. 
Wis ease, the model of Section I with si- 
sus distribution of all n prizes leads to 
eximately symmetric equilibrium in 
otal value of the prizes is com- 
pated. When one prize is distrib- 
tially over r rounds, we achieve the 
‘amount of the expected bids but we 
â sihe equilibrium in the limit is not 
ince P, > 1 — (1/2)", P> 1 — 
ytor i = 2, ..., n + 1. This is due 
that the rank of a player determines 
al number of times that the player 
Hen prizes are distributed sequen- 
‘Means that the character of the 


as used, even if the majority of 
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awarded simultaneously. 
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chy, and wishes to inspire extra effort.'? One 
way of doing this would be to instigate a 
winners-take-all contest of the type discussed 
in the preceding sections. A variety of prizes, 
such as paid leave or promotion to the next 
level of the hierarchy, may be used. The firm 
would require some ordinal standard by which 
to measure worker performance and make a 
ranking on the basis of which prizes are 
awarded. Suppose that the workers believe, 
based on past experience, that there is a posi- 
tive relationship between extra effort (x) and 
the identity of the winners; our models can 
then be used to calculate the expected amount 
of extra effort and the likely beneficiaries un- 
der different distributional rules. 

Recall that the v; parameters can be inter- 
preted as expressing differences in the cost of 
effort between workers, so that the higher the 
v; the lower the cost of effort to worker i; these 
parameters could be seen as expressing the 
abilities of the workers. There is no a priori 
reason to assume that a firm owner or manager 
believes that the workforce is homogeneous. 
Generally, the manager will have a belief or 
perception about the composition of the work- 
force (the v; ); on the basis of these beliefs, the 
manager can work out the expected amount of 
extra effort which will ensue in different con- 
test forms. To illustrate this, consider the sim- 
plest possible case in which two identical 
prizes are available which can be distributed 
simultaneously or sequentially, one after the 


- other. Given her beliefs about the v;, how is 


the manager to distribute these prizes to 
achieve the highest level of extra effort? Si- 
multaneous distribution of the two prizes 
yields the effort levels and win probabilities in 
(7)—(9). If the prizes are awarded sequen- 
tially, one after the other we find (using capital 
letters to distinguish from simultaneous 
distribution ) 


12 We assume that this extra effort is valuable to the 
firm without explicitly modelling how this affects output 
or profit. In contrast, ‘‘effort’’ in the applications men- 
tioned in Section IV may be regarded as socially wasteful 
and thus the goal would be to minimize this activity. 
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for individual effort levels, and 
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as the expected total amount of effort. 

Z(1, 1) increases in v3; however, Z(1, 1) is 
decreasing in v, and v7, via the effect which 
these variables have on the bid of player 3." 
Player 3 competes only for the second prize, 
facing player 1 or 2, whoever is the loser of 
the first contest. The probability that player 3 
gives a Strictly positive bid is v3/v’ where v’ 
is the valuation of the loser of contest one; thus 
this probability is decreasing in v, and n2. If 
player 3 bids x, > 0 then he wins the second 
contest with probability 0.5; the probability of 
bidding x, > 0 is v/v, if player 1 is the op- 
ponent (probability 0.5 of this occurring) and 
v3/V, if player 2 is the opponent (probability 
0.5). Multiplying these terms yields P;(1, 1). 

The amount of effort exerted by players .1 
and 2 is dependent only upon the upper sup- 
port of the probability distribution function 
which they use in equilibrium. Players 1 and 
2 compete over the first prize, each exerting 


__ " Thus the logic behind the exclusion principle is valid 
in this case. 
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player who loses the first. Thus the‘é 
amount of effort by players 1 and 2 
-~ ~- Now-consider which of the two con 
multaneous or sequential, yields th 
expected effort in total. Note that , 
three workers with the lowest marginal c z 
of effort (highest v; ) will be active 
contest form. Consider what happe 
v, > œ so that v, is very large in rela 
and v3; this may be interpreted as’ 
having a much lower cost of effort 
other two. Comparing (9) and (17)‘in 
that z(2) > Z(1, 1) in this case so; 
multaneous distribution yields the gréites 
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z(2) so that sequential distribution 18"; 
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effort level. Which type of conte 
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id as (due to a higher relative effort 
effort: in the simultaneous contest; this is be- 
(17) i this contest affords player 3 a chance of 
ase so g:two contestants whereas this player : 


tes only once and against only one other 
gantender in the sequential contest. Worker 2, 
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When Ang the sequential contest as he competes 
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gach time. For worker 1, we find that 
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A. Rent Seeking 
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t that a decision maker often has a 


Sethe Contrast to the tournament literature dis- 
‘or po , i ae +: Where agents are identical ex ante and all 
in the symmetric Nash equilibrium. 
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sions, permits, or transfers, for example, often 
gives rise to activity and resource use which 
is explicitly intended to influence the decision 
[see Gary S. Becker (1983, 1985)]. Such 
‘‘rent-seeking’’ activity has been the subject 
of a large literature (see Shmuel Nitzan [1994]. . 
for a survey ). Since those who fail to influence 
the decision maker do not have their ‘‘bids’’ 
refunded, the all-pay auction has often been 
used to model rent-seeking activity (see 
Hillman and Samet [1987]; Hillman and Riley 
[1989]; Ellingsen [1991]). By explicitly mod- 


. elling the competition for several rents, we 


have extended traditional rent-seeking theory 
which typically concentrates on the case of a ` 
single rent. We have shown that the charac- 
teristics of the equilibrium, the amount of rent 
seeking, and the probability distribution of the 
beneficiaries are sensitive to whether the rents 
are made available simultaneously or in some 
sequential manner. 

In the rent-seeking tradition, our models most 
resemble those of Mark Gradstein and Nitzan 
(1989), who examine a situation in which iden- 
tical players compete for a number of indivisible 
rents; each player must decide for which 
prize(s) to compete and how much to bid for 
each prize. With homogeneous prizes, their fo- 
cus is on symmetric Nash equilibria. In contrast, 
our approach presents an all-pay auction with 
complete information in which competitors with 
asymmetric valuations bid for several homoge- 

. neous indivisible prizes; thus we do not restrict 
attention to symmetric equilibria. 

Nitzan (1994) indicates that one of the major 
challenges facing rent-seeking theory is to en- 
dogenize the contest form; several writers have 
discussed the possibility of the contest admin- 
istrator designing a contest in order to achieve 
the maximum contest revenue. In the context 
of the single-prize all-pay auction, Ian Gale and 
Mark Stegeman (1994) suggest the use of 
handicaps, while Baye et al. (1993) propose 
excluding some of the players. Elie Appelbaum 
-and Eliakim Katz (1987) look at-the optimal -. 
value of the prize in an imperfectly discrimi- 
nating contest, while Robert Michaels (1988) 
and Amihai Glazer (1993) consider endogeni- 
zation of other parameters in these models. Oth- 
ers have considered the possibility that an 
outsider (i.e., not the contest administrator ) can 
influence the form of the contest. Society may 
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be interested in limiting the amount of waste 
from rent seeking, for example. For an imper- 
fectly discriminating contest, Avinash Dixit 
(1987) suggests that a method of achieving this 
is to dictate the order of play; Roger D. 
Congleton (1984). indicates that the decision- 
making authority in a rent-seeking contest 
should be delegated to a committee rather than 
to a single individual. It is important to note 
that rent seeking is not just a phenomenon 
which arises as a consequence of governmental 
or political decision-making. Paul R. Milgrom 
(1988) discusses rent seeking, or influence ac- 
tivities, in organizations; rent-seeking activity 
may well arise in firms with a fixed hierarchical 
structure as workers at each level compete 
among themselves for a limited number of pro- 
motions to the next level. In contrast to the ef- 
fort contest in Section II as a means of securing 
promotion, the rent-seeking view suggests that 
workers spend time on nonproductive activity 
in the quest for promotion. 

Our two models represent a reasonable way 
in which the contest form can be endogenized. 
By deciding how the prizes are to be distrib- 
uted, sequentially or simultaneously, an admin- 
istrator may be able to maximize contest 
revenue. Alternatively, an outsider may be able 
to limit the amount of rent seeking or increase 
the probability that certain players are the likely 
beneficiaries by choosing how the administrator 
is to award the prizes. It is important to note 
that we are not suggesting that a contest ad- 
ministrator (or outsider) will explicitly choose 
an all-pay auction as a distribution mechanism; 
rather, we feel that the all-pay auction is a use- 
ful modelling device when a decision process 
involves discretion so that the final decision is 
open to influence. Given that a process which 
resembles an all-pay auction arises, our models 
can be used to assess the effects of awarding 
all of the prizes simultaneously and awarding 
blocks of prizes sequentially. l 


B. Rationing by Waiting 
Johansen (1987)'° used the all-pay auction 


under complete information to analyze a situ- 


'® This article was actually written in 1982, shortly be- 
fore Johansen’s death; thus, his work on the all-pay auc- 


ation in which a good could be 
waiting; bids are interpreted as 
that a high bid means arriving early iras 
secure the good. His analysis cog 
mainly upon the mixed strategy eigi 
which arose when a ‘single good was: ; 
although he did also begin to consi 
in which symmetric agents compete 
homogeneous, indivisible units of 
However, he did not give a full charac: 
of the equilibrium except for some: 
cases, conjecturing that the general cags 
need to be solved using numerical ana} 
models which we present and solve ¢; 
depict exactly this general case. 
Charles A. Holt, Jr. and Roger, $ 
(1982) consider a waiting-line auct 
in which valuations are private: inf 
and there is an entry fee (defined as; 
of travelling to the queue). The formal mig? 
which they use is similar to our simul 
model of Section I, in which each ¢@ 


may win no more than one unit of the rating’ 


good; since consumers are ex ante idé 
the focus in their paper is on symmetrie {Ẹ 
strategy) equilibrium. Our model, in c% 
is one of complete information in whith! 


personal differences in valuations are [ipe 


tant for equilibrium behavior. Our 


those of Holt and, Sherman (1982) could $ 


seen as steps toward solving the general 
of the waiting-line auction in which val 
are private information, but consumers 
ex ante identical.” 


APPENDIX 


PROOF OF PROPOSITION 1: 
Denote by m the number of players 
x > 0 with a positive probability (1+ 
participants). Observe first that m-18 @.* 
n + 1, otherwise the optimal outlays’ 
(where s > 0 is arbitrarily small), S% 


already referred. 


17 Amann and Leininger (1996) consider a Ser | 
ger ( ) EA 


all-pay auction in which two individuals C0% 
prove the existence and uniqueness of a Bayesi 
rium when the valuations are private info 
are drawn from different probability distribu’ 

























































MARCH . 
e Ta ; h actively participating players re- 
ie : prize with certainty. Let us first show 
nly n F te <Sqayilibrium distribution functions are 
conceg gs above Zero. Assume the contrary 
y equine : a lay yer i chooses x’ > 0 with a strictly 


robability and hence i’s distribu- 
on is discontinuous. Observe that 
0 otherwise i would lose with cer- 


fancti 


ete fi 

of x'isnota best response. Consider 
harac ej+i It follows that Gj (x’) is 
som ter than G;(x') where Gt (x') = 
sral Case ‘Wwe G(x)Ix > x’ from above. Hence, 
al analy 
solve ë G(x) Uji Xj 
toger Shen Lontinuous and jumps upwards at x’. 
» auction:med ingly, 
ate informais : 

ned as —x' > G(x! — 6)v;— (x' ~ 6) 
e fo 
ar sim 6 in a sufficiently small interval 6 € 
each Hence no player j + i bids in the 
t of the jal [x' — 7, x'], which implies that 
c ante id f= r) = G;(x’). Since player i may choose 
symmetrie tg td strictly lower than x’ with no reduction in 
ydel, in probability of winning, x’ is not a best reply. 
n in whi Thas-the “equilibrium distribution function of 
tions afe! i must be continuous above zero. 

Our r der the upper support of the equilib- 
(1982) stribution functions, and denote by 


. We will now show that v7 = 
at least n + 1 players. Assume on the 
fy.that v; = v“ for strictly less than n + 
Bayes and denote by v”? the second-highest 

rt. Since the equilibrium distribu- 
tions are continuous above zero, it 


1: Support. v" may reduce his expected bid 
of players ucing the probability of winning. 
bility ( e equilibrium strategies are mixed, 


at v; > v! for all actively partic- 
Players. Consider player i. Then 


equality i 1s replaced by equality if 
aking into account that G; (v“) = 
ance the distribution functions are 
shove zero) it follows that, 
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where the latter weak inequality is replaced with 


-a strong inequality if v“ < v;. Assume that the 


number of actively participating players m is 
strictly greater than n + 1. Since v;_, > v; and 
the valuation of the lowest-ranked active player 
is at least v“, it follows that G; (vi) > 0 for: 
strictly more than n players. Since the equilib- 
rium distribution functions are continuous above 
zero this implies that at least two active players 
bid vi = 0 with a strictly positive probability, 
and not more than n players bid x > 0 with 
certainty, thus G; (0) > 0. However, this implies 
that the winning probability is discontinuous at 
zero, G7 (0) > G;(0), which is incompatible 
with an equilibrium (any player is better off bid- 
ding an arbitrarily small amount, than bidding 
zero). Hence m is at most n + 1. As we have 
already shown that m is at least n + 1 it follows 
that exactly n + 1 players participate actively in 
equilibrium. Furthermore, as G;(vj) = 1 — 
y“/v; and v; = v“ and G;(v}) = 0 for at least 
one player, it follows that v“ is equal to the val- 
uation of the lowest-ranked active player. Hence 
if there exists a nonactive player with a higher 
valuation than the lowest-ranked active player, © 
the former can bid v” with certainty and make a 
strictly positive profit. Thus it is the n + 1 
highest-ranked players who participate actively, 
v“ = v,,,. Hence, setting x; = v“ = v,,, and 
G;(v") = 1 in payoff equation (1), we see that 
the net surplus of player i is v; — Va+ı. 

To show that the equilibrium distribution 
functions are strictly increasing, assume that, 
on the contrary, there exists an interval, say 
(vj, v; ), at which the probability density of 
player i’s distribution function is zero. Since 
the number of active players is n + 1, it fol- 
lows that G,(v/) = G,(v;’) for all j + i. Ac- 
cordingly, the density of any player is zero in 
this interval. However, this implies that 


Givi )u;— vi > Gui v- 47; 


thus, v7;’ is not an optimal reply and hence the 
equilibrium eibanon functions are aed 
increasing: -- >s as 

Since the equilibrium distribution NT 
are strictly increasing, we know that 


Gi(x)v; — x 


is independent of x Vx € [vj, vi]. 
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Since vi = v4, Vi, hence G;(¥_41) = 1 Vi, 
it follows that the unique function for the prob- 
ability that 7 wins is 

G; ( x)=1- Vat) + = 


vi vi 


(A1) 
which implies that 
G(x) > Gei (x) V K= 1,..., n; 


hence, 


l 


l l 
Vagy > V> V> w EE 


>v =v 


where the latter weak inequality is due to the 
fact that G; (0) = 0, (that is, only one of the 


n + 1 active players may play an atom at 
zero). 


Consider the interval [v], v,.,]. Since all 
players bid with positive probability in this in- 
terval, player j’s probability of winning, 
G;(x), can be written 


G(x) =1-JJ (1 - F;(x)). 


i+j 


(A2) 


From (A1) we obtain the probability that j 
wins (by a change of subscript), which when 
inserted into (A2) gives 


Vrsi X 
er ESF Gy) 

Vi i+j 
This gives n + 1 equations which determine a 
unique solution for F; (x), i = 1,2, ..., n + l, 
over the interval [vi > Va+1]. The equilibrium 
distribution functions over the intervals [v!, 
vi-1), i = 2, 3, .., n + 1 are determined 
recursively. 
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Intellectual Human Capital and the Birth 
of U.S. Biotechnology Enterprises 


By LYNNE G. ZUCKER, MICHAEL R. DARBY, AND MARILYNN B. BREWER’ 


The number of American firms actively using 
biotechnology grew rapidly from nonexistent to 
over 700 in less than two decades, transforming 

“the nature of the pharmaceutical industry and 
significantly impacting food processing, brew- 
ing, and agriculture, as well as other industries. 
Here we demonstrate empirically that the com- 
mercialization of this technology is essentially 
intertwined with the development of the under- 
lying science in a way which illustrates the sig- 
nificance in practice of the localized spillovers 
concept in the agglomeration literature and of 
the tacit knowledge concept in the information 
literature. Indeed we present here strong evi- 
dence that the timing and location of initial usage 
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Torero. Armstrong was principally responsible for con- 
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by both new dedicated biotechnolog, 
( ‘‘entrants’’) and new biotech suby 
existing firms ( ‘‘incumbents’’) are 
explained by the presence at a parti 
and place of scientists who are active] 
uting to the basic science as representa i 
publications reporting genetic-sequencg 
eries in academic journals. 

By quantifying separable effects of 
ual scientists, major universities, and fa 
research support we provide specific § 
to the role of universities and their facullig 
encouraging local economic devel 
through what are conventionally describyf 
the literature as geographically loéalli 
knowledge spillovers.’ Such localized 
edge spillovers may play fundamental t 
both economic agglomeration and endogé 
growth (Paul M. Romer, 1986, 1990; Géae 
Grossman and Elhanan Helpman, 
However, our evidence, like the other. 
ture cited here, specifically indicates | 
effects without demonstrating that the 
characterized as spillovers (or externalites: 

Section I lays out our basic hypothesis 1%; 
data are described in Section II. Empin@ 


ee, 


— 


A summary and conclusions section (Sä 
IV) and Data Appendix complete the 


I. The Hypothesis — 


growth literature as a nonrivalrous g 
useable by an unlimited number of 


' Zvi Griliches (1992) has surveyed the i 
R&D spillovers as a major source of endogea™ 


culties, there have been a number of articles 1% 
idence of geographic localization of knowlege 4 
including Adam B. Jaffe (1989), Jaffe et al 

Edwin Mansfield (1995). 


















































zero marginal cost (Richard R. Nelson 
1996). A complementary literature 
that some information requires an in- 
+ of considerable time and effort to mas- 
; fuman capital developed by this 
“tit ig seen as earning a normal return on 
#of the investment, both direct costs and~ 
earings. We believe that some inno- 


hnolo icularly a breakthrough ‘‘invention 
th sul Zahod of inventing” (Griliches, 1957), 
) are “petter characterized as creating (rival- 
parti man capital — intellectual human 
Ctivel ¿characterized by natural excludability 
repre to a set of instructions for combining 


d outputs which can be protected only 
Jectual property rights. This natural ex- 
“try arises from the complexity or tacit- 
the information required to practice the 
on (see Nelson [1959], Kenneth J. 
1962], Nelson and Sidney G. Winter 
and Nathan Rosenberg [1982]). 

on both extensive interviews and em- 
ork summarized in Zucker and Darby 
we believe that, at least for the first 
§ years, the innovations which underlie 
ology are properly analyzed in terms 
ally excludable knowledge held by a 


es, and {rama 
g Bay i" 
ecific strig 


ipman, sall Initial group of discoverers, their co- 
the othe grs and others who learned the knowl- 
dicates loc € from working at the bench-science level 
that they. ati those possessing the requisite know-how. 
r extern imately the knowledge spread sufficiently 
hypoth ‘Sidely'to become part of routine science 
a Emp d be learned at any major research 
ed in 


, After the initial 1973 discovery by 
ohen and Herbert Boyer of the basic 
juéfor recombinant DNA—the foun- 
k of commercial biotechnology as well as 
sis; / K Of scientific innovation—the finan- 
; ms available to talented recombinant- 


y reae : entists first rose dramatically as the 
OUS BOO Pee cet al implications became widely ap- 


$- iaia and then more gradually declined as 
* more scientists learned the tech- 

til knowledge of the new techniques 
ned oniy the’ normal return for the 
d for a graduate student to master 
er, mere knowledge of the tech- 
mbinant DNA was not enough 
xtraordinary returns; the knowl- 
More productive when embodied 
with the genius and vision to con- 
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tinuously innovate and define the research 
frontier and apply the new research techniques 
in the most promising areas. 

We hypothesize that entry of firms into bio- 
technology in a given year thus will be deter- 
mined by the geographic distribution of: stars 


~and perhaps others then actively practicing the 


new science as well as by the geographic dis- 
tribution of economic activity. Stars are prop- 
erly viewed as locationally (semi-) fixed since 
few star scientists who knew how to do recom- 
binant DNA were willing to abandon their uni- 
versity appointments and laboratory teams to 
pursue commercial applications of biotechnol- 
ogy. The primary pattern in the development 
of the industry involved one or more scientist- 
entrepreneurs who remained on the faculty 
while establishing a business on the side— 
businesses which, where successful, resulted 
in millions or even billions of dollars for the 
professors who acquired early ownership 
stakes. Thus, we see the university as bringing 
about local industrial benefits by permitting its 
professors to pursue private commercial inter- 
ests while their faculty appointments tie them 
to the area. In preliminary work not reported 
here, we tried to develop measures of local 
economic activity for industries, like pharma- 
ceuticals, specifically impacted by the new 
technology, but these attempts never added 
significantly to the measures of general activ- 
ity used in the empirical work below. The lo- 
cal availability of venture capital is widely 
believed to play a significant role in the birth 
of new biotech entrants (Martin Kenney, 
1986; Joshua Lerner, 1994, 1995); so we also 
include that variable in our regressions. 


If. The Data 


Data has been collected in panel form for 14 
years (1976-1989) and 183 regions (func- 
tional economic areas as defined by the 
U.S. Department of Commerce, Bureau of 
Economic Analysis [BEA], 1992b). Fre- 
quently, the data are aggregates of dafa at the 
zip code or county level.” Lagged variables 


2 The BEA’s functional economic areas divide all the 
counties in the United States into regions including one or 
more cities, their suburbs, and the rural counties most 
closely tied to the central city. 
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include data for 1975 in the unlagged form. 
See the Data Appendix for more details. 


A. Firms 


Our data set on firms was derived from a 
base purchased from the North Carolina Bio- 
technology Center (NCBC) (1992) which 
was Cleaned and supplemented with informa- 
tion in Bioscan (1989—1993 ) and its precursor 
(Cetus Corp., 1988). We identified 751 dis- 
tinct U.S. firms for which we could determine 
a zip code and a date of initial use of biotech- 
nology. Of these 751 firms, 511 were entrants, 
150 incumbents, and 90 (including 18 joint 
ventures) could not be definitively classified. 
By 1990, 52 of the 751 firms had died or 
merged into other firms. 

We then calculated the number of births in 
each region by year of initial use of biotech- 
nology for all 751 firms as well as for their 
identified subcomponents of entrants and in- 
cumbents. We also have the stocks of surviv- 
ing firms, entrants, and incumbents by region 
and year. 


B.. Scientists 


Early in our ongoing project studying the 
scientific development and diffusion of bio- 
technology, we identified a set of 327 star sci- 
entists based on their outstanding productivity 
through early 1990. The primary criterion for 
selections was the discovery of more than 40 
genetic sequences as reported in GenBank 
(1990) through April 1990.* However, 22 sci- 
entists were included based on writing 20 or 
more articles, each reporting one or more 
genetic-sequence discoveries.* In the 1990’s, 


3 See Zucker et al. (1993). As will be obvious, much 
of the time between 1990 and the initial submission of this 
paper was spent in developing reasonable measures of in- 
tellectual human capital and in collecting and coding data 
necessary to locate the authors of the discoveries-reported 
in the articles in question and to trace the diffusion 
process. ; 

* Scientists advised that some sequence discoveries are 
more difficult than others and thus merit an article report- 
ing only one sequence. Therefore we included scientists 
with 20 or more discovery articles to avoid excluding sci- 
entists who specialized in more difficult problems. 


` both concentrated and highly core 



































sequence discovery has become roy 
is no longer such a useful measure 
success. These 327 stars were ¢- 
quarters of one percent of th 
GenBank (1990) but accounted fg; 
cent of the published articles, alm, 
as many articles as the average scjgs 
We collected by hand the 4,06] à 
thored by stars and listed in GenBåä 
corded the institutional affiliation 
and their coauthors on each of th 
These coauthors are called ‘‘collaboys 
they are not themselves a star. Somg 
the stars and collaborators who ever'p 
in the United States is given on the Je 
Table 1, where the scientists are idej 
the organization(s) with which they: 
filiated on their first-such publicați 
higher citation rate for firm-affiliated: 
is explored at length in Zucker. and Reem 
(1996). ce 
Figure 1 illustrates the time patègi 
growth in the numbers of stars and 
rators who have ever published and 
number of firms using biotechnology:é 
United States. There was a handful of sag 
who published articles reporting gies 
sequence discoveries before the 1973 Wt 
through, but even after 1973 their 
increased gradually until taking off ia 
The numbers of collaborators and firms 
behind the growth in stars by some 
To identify those scientists clearly. 
at the edge of the science in a given yee 
term a star or collaborator as ‘‘activé 
or she has published-three or more 
discovery articles in the three-yeaf: 
window ending with that year. As seed 
right side of Table 1, this stringent 
screen provides an even more elite 
of star scientists as well as iden ; : 
very significant collaborators. We @ aes NA whi 
each year the number of active stars and een 20 Comm 
collaborators who are affiliated with #73 petica readers 
nization in each region. j 
The locations of active stars and 


tines: 


graphically, particularly early in $ 
Figure 2 illustrates this pattern for 
period by accumulating the num 
who have ever been active in each 
to 1990 and plotting them togeth 
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TABLE 1—DISTRIBUTION OF STAR SCIENTISTS AND COLLABORATORS 
Wuo Have EVER PUBLISHED IN THE UNITED STATES 
Full data set Ever active in U.S.° 
ae _Numberof ------ _ Citations‘/ Number of Citations‘/ 
ston type scientists scientist/years scientists scientist/years 
158 85.5 108 110.8 
44 63.0 26 98.7 
5 143.7 1 694.3 
0 ma 0 n/a 
207 135 
2901 10.4 369 30.6 
716 13.7 88 35.8 
324 29.2 43 99.1 
3 7.2 0 n/a 
4004 500 














. èa United States. 


b- meavidaal was listed as an author. 


aon which we have laboriously col- 
e found two measures of regional 

© which entered separately in re- 
feported below, but none which 
le of eliminating the effects of the 


. 


3 @rganization type refers to the affiliation listed on their first publication with a U.S. affiliation. 

Ever active in the U.S. means that in at least one three-year period beginning 1974 or later and ending 1989 or earlier, 
aust was listed on at least three articles appearing in our data set of 4,061 articles which reported genetic-sequence 
and were published in major journals and that the affiliation listed in the last of the three articles was located 


Citation counts are for 1982, 1987, and 1992 for all articles in our data set (whenever published) for which the 


One measure is a count of the number of 
‘top-quality universities’’ in a region where 
top quality is defined by having one or more 
“‘biotech-relevant’’ (biochemistry, cellular/ 
molecular biology, and microbiology ) depart- 
ments with scholarly quality reputational rat- 
ings of 4.0 or higher in the 1982 National 
Research Council survey (Lyle V. Jones et al., 
1982). There are 20 such universities in the 
United States. Our second measure, ‘‘federal 


-wiy te 


$ The 20 universities were: Brandeis University, Cali- 
fornia Institute of Technology, Columbia University, Cor- 
nell University, Duke University, Harvard University, 
Johns Hopkins University, Massachusetts Institute of 
Technology, Rockefeller University, Stanford Univer- 
sity, University of California-Berkeley, University of 
California-Los Angeles, University of California-San Di- 
ego, University of California-San Francisco, University of 
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—æ--20 Collaborators 
es 


#4 New Biotech Enterpris 





Cumulative Number 


support, ’’ is the total number (in hundreds) of 
faculty supported by 1979-1980 federal 
grants to all universities in each region for 
biotech-relevant research.° These variables 
take on the same value for a given region in 
each year. 


D. Other Variables 


Using listings in Stanley E. Pratt (1982), we 
measure ‘‘venture capital firms’’ as the num- 
ber of such firms in a region legally eligible to 
finance start-ups in each year up to 1981. For 
later years, the number of firms is fixed at the 





Chicago, University of Colorado at Denver, University of 
Pennsylvania, University of Washington (Seattle), Uni- 
versity of Wisconsin-Madison, and Yale University. 

6 We also tried a measure of biotech-relevant research 
expenditures as reported by the universities, but this vari- 
able was too collinear with the federal support variable to 
enter separately and appeared to be less consistently mea- 
sured across universities. i 


-in all industries (in millions of perso! 






















number in 198 to avoid possible simults 
problems once the major wave of bio 
founding began.’ (While great books 
spring up around great universities, the fora & 
should not be counted as causing the latte 

Since entry of biotech firms would 
pected to occur where there is other ect 
activity, particularly involving a highly s 
labor force, we also include total emplo 


average wages (measured by deflated a 
earnings per job in thousands of 1987 
for each region and year. 

Finally, an increase in, the (all-equity), 
of capital, as measured by the earning 
ratio on the Standard & Poor’s 5% 
would reduce the net present value 


approach to this problem if suitable instrum@ 
found. 



















wet 


g 
ild have a negative impact on birth 
8, entrants or incumbents. 


; im. Empirical Results 


e reis i hypothesis using both the full 
ms sala and by regressing the geographical 
E zbyli n of the data in 1990 on values 
„dependent variables circa 1980. The 
More fully exploit the available infor- 
le the latter avoid problems of pos- 
Itaneity which might arise after 
£8 commercial biotechnology be- 


the Tegressions reported here, as 
iar tensive sensitivity analysis noted 
ET SSümated in the poisson form ap- 
irr, COunt variables with numerous 
has TS; LIMDEP (William H. Greene, 
Pinseri o ie: 9-49 ) | with the Wooldridge 
ae ¢-correction for the variance- 


icant economic factor in some - 


295 


+ = New Biotech Enterprise 


Ficure 2. Ever Active Stars AND New BIOTECHNOLOGY ENTERPRISES AS OF 1990 


covariance matrix estimates.* The poisson 
regressions estimate the logarithm of the ex- 
pected number of firm births; so the signs and 
significance of coefficients have the usual in- 
terpretation. Although OLS regressions are in- 
appropriate for our count dependent variables 
with most observations at zero and the rest tail- 
ing off through small positive integers, we 


8 As discussed in Jerry Hausman et al. (1984), the pois- 
son process is the most appropriate statistical model for 


crated 


count data such as ours. In practice, oVerdispersion (pos- ` 


sibly due to unobserved heterogeneity) frequently occurs. 
Given the problems with resort to the negative binomial 
(A. Colin Cameron and Pravin K. Trivedi, 1990), Jeffrey 
M. Wooldridge (1991) developed a flexible and consistent 
method for correcting the poisson variance-covariance 
matrix estimates regardless of the underlying relationship 
between the mean and variance. We are indebted to 
Wooldridge and Greene for advice in implementing the 
procedure in LIMDEP. 
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reported broadly consistent results using that 
technique in an earlier version of the paper 
(Zucker et al., 1994). 

In our sensitivity analysis, we.ran the same 
poisson regressions for entrants and incum- 
bents defined both exclusive and inclusive of 
the arguable case of joint ventures. The results 
were generally very similar to the subcompo- 
Tent régréssions in Table 4. In other unre- 
ported poisson regressions, we found that 
eliminating those regions with no firms and no 
stars from the sample did not result in quali- 
tatively different results. 


A. The Long-Run Model 


Because of concerns about possible simul- 
taneity biases once the industry became a 
significant economic force, we begin our em- 
pirical discussion with models which relate the 
number of firms in each region at the begin- 
ning of 1990 to the distribution of intellectual 
human capital and other variables as of about 
1980. These results provide something of an 
acid test of our approach. 

In Table 2, we present cross-section poisson 
regressions across the 183 regions explaining 
the number of firms in each at the beginning 
of 1990 when our data set ends.? Column (a) 
restrains the analysis to only the numbers of 
stars and collaborators ever active in each re- 
gion at any time up through 1980, while col- 
umns (b) and (c) add first their squares and 
then our other intellectual human capital vari- 
ables. Column (d) considers alternatively 

_ other economic variables which might explain 
entry, and column (e) combines the variables 
in (c) and (d). Column (f) adds to this model 
the number of biotech firms existing in 1980. 

Column (a) in Table 2 indicates that the 
number of stars and collaborators active 
through 1980 is a powerful predictor of the 


°In an earlier version of this paper we included an al- 
ternative form of Table 2 (available from the authors upon 


request) in order to forestall interpretations that the results _ 


in Table 2 may reflect reverse causality. This alternative 
table reported regressions which explain the number of 
firms alive at the beginning of 1990 that were born after 
1980. Nearly identical results were obtained, reflecting the 
fact that bulk of new biotechnology enterprises were 
founded after 1980. ` 































geographic distribution of biotech eš 
in 1990, since the log-likelihood js 
—871.9 compared to —1401.7 for: 
alone. It is the star scientists that 
positively, with collaborators having: 
smaller negative coefficient in this ra, 
and most of the other long-run m, 
cussed below. We had expected, that t 
efficient on collaborators would bg’ 
smaller than that on stars, but positive 
obtain a positive coefficient on ACtIVE cat 
orators when the squared terms are add 
column (b), but that turns negative ag: 


columns of Table 2.’° (In the annual 
sions discussed below, we generally ¢ 
positive effects of active collaborator 
they are often statistically insignificant, 


erally negative sign on the number of tee’ 
collaborators in the long-run regression a 


effect on the entry of firms but reduce thé: 
fect of stars who are devoting more of Bat 
time to training students and relatively leš 

starting their own firms. Training collabovaias 
is surely a useful and rewarded ac 
particularly for the academic stars—butit 
take more of the stars’ energy than it isi 
if firm birth were the only criteria." (ii) 7. 
sign and magnitude of the coefficient.o2§ 
laborators may simply reflect signifi¢c 
ticollinearity among thé intellectual 
capital variables in the very early years. 3 
is especially likely since when we examine E g 


© In column (b) of Table 2 (and Table 3), the R= 
coefficient on the squared term indicates that gs 
ber of stars or collaborators increases, their ma] 
tribution diminishes eventually passing throu 
collaborators, in columns (c) -— (f) of these tables i 
pattem reverses so that the partial derivative 
probability of birth with respect to collaborat 
negative, and increases as their number in 
tually becoming positive. 
....!' In support of this explanation, we note E: 
sensitivity analysis we tried regressions which $ se 
interaction terms multiplying the numbers © 
and collaborators for the squared terms. In ho 
sions, we obtain significant positive coeffice 
numbers of stars and collaborators and a $ 
ative coefficient on their interaction. ' 
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ach e SSON REGRESSIONS ON THE STOCK OF BIOTECH-USING FIRMS AT THE BEGINNING OF 1990 By REGION 
id incre: 
for 4 ai (a) (b) (c) d) ©) G 
Msc? 

that é O.911*** — 0.644*** —0.4687** = —2.595***  —2.718***  —2,607*** 
raving AI is “= " 0.014) (0.015) (0.033) (0.086) (0.256) (0.345) 
this aha ‘active at any 0.567*** 0.587*** 0.466*** — 0.877*** 0.649*** 
m modii P g 1976-80 (0.029) (0.072) (0.090) — (0.076) (0.084) 
ed that thea rators active —O.076***  0.175***  —0,183** = ~0.333***  —0,261*** 
ould EAN time during 1976- (0.012) (0.033) (0.068) — (0.045) (0.037) 
Siti x z 
1 acti a i vrs active at any e —0,028*** —0.019 BE —0.049%**  —0.024* 
38 are g ne g 1976-80)” — (0.007) (0.014) — (0.012) (0.012) 
oe again wa -+ Hiaborators active = —0.005*** . 0.002 = 0.007** 0.001 
l ae wi ‘me during 1976- eg . (0.001) (0.003) oS (0.003) (0.002) 
nerally edi ~ 4 = ality ee es 1.388*** = 1.504***  0,442* 
estes i ties in the region 5 2> (0.150) = (0.107) (0.195) 

cant.) a 3 
a for x faculty with federal — — 0.263 — 0.752*** 0.711*** 
umber di ; ‘sepa in the region — — (0.143) — (0.088) (0.051) 
regressione Senber Venture capital — — — 0.017***  —0.045***  —0.013** 
tially off asja the region in = a oz (0.002) (0.003) (0.004) 
a positive ga . pe 
ut reduce. (haw EN édployment (all — — — 0.222***  —0.009 —0.213*** 
1g mor atnes) in the region in — — . — (0.019) (0.043) (0.049) 
relativ 
ing collabora fäeg Wages per job in — — — 0.166*** 0.143*** 0.139*** 
ded activa i Feglon in 1980 — — = (0.004) (0.014) (0.019) 
3tars— - z births of biotech — — = = 5 0.300*** 
y than itis aas duting 1976-80 = = is = — (0.025) 
riteria. ~UR f ~871.9 -707.3 -543.2 -753.9 -416.0 -350.7 
oefficient: 0K ESE a 
t signifi g-likelihood —1401.7 —1401.7 —1401.7 ~1401.7 —1401.7 —1401.7 
tellect eee = 183. Standard errors (adjusted by Wooldridge, 1991 Procedure 2.1) are in parentheses below coefficients. 
early ELA Significantly different from 0 at the 5-percent level. 
n we exams Significantly different from 0 at the 1-percent level. 

Age : ignificantly different from 0 at the 0.1-percent level. 

Table 3), J i 
apne at USs-Section/time-series results just be- grants at universities.on birth of firms in a 
pia through zai tain (we think more reliable) zero given geographic region. The intellectual hu- 
bf these tab : coefficients on collaborators, so the, __ man capifal. variables_aione increase the log- - 
‘derivative o aT ely disappears. likelihood ratio from — 1401.7 to —543.2 [see 
eee ‘ec, fundamentals” model (excepting column (c)], with the final three variables 
mber a ssade-lagged dependent variable) is pre- bringing this quantity up to —416.0. As to the 
L, we noie DA umn (e) of Table 2, where all the last three variables, the quality of the labor 
jons Which Sg LASN are significant except that for total force, measured by average wages per job, 
fumbers j 8 Leaving aside the question of the seems much more relevant than its size. Sur- 
eS S =, 0orator coefficient, we note the prisingly, to some observers, the number of 
te ae ae a tive, Separate effects of stars, top- venture capital firms in a region enters, but. 

EEN. 7 “Versities, and federal research with a significantly negative sign. We interpret 
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the negative sign as evidence that venture cap- 
italists did play an active role in the formation 
of entrant firms, but they apparently resulted 
in fewer, larger firms being born in the areas 
in which they were more active.’ 

This sign of the coefficient of the number of 
venture capital firms in a region is robust in 
sensitivity experiments with, other forms (not 
reported here) except for regressions which 
exclude the, intellectual human capital vari- 
ables such as in column (d). That regression 
looks good in terms of significance and ex- 
pected sign pattern although it has a much 
lower explanatory power than the intellectual 
human capital variables alone [column (c)]. 
Just below, we report very similar results in a 
cross-section/time-series context. Thus, it is 
certainly easy to see why the evidence for an 
important positive impact of venture capital 
firms on the birth of biotech firms may have 
appeared stronger in previous work than seems 
warranted based on fuller models: Since ven- 
ture capital firms have developed around a 
number of great universities, their presence 
proxies for intellectual human capital in the 
absence of more direct measures; if they are 
the only variable indicating presence of great 
universities and their faculties, they enter pos- 
itively even if their packaging activities result 
in a negative direct effect on births. 

The decade-lagged dependent variable is 
added to the full fundamentals model in col- 
umn (f) of Table 2. Doing so primarily has 
the effect of weakening the significance of the 
top-quality universities variable (but, see the 
annual model below) due to significant mul- 
ticollinearity between the variables.'* One in- 


terpretation of this positive coefficient on the , 


lagged dependent variable is that agglomera- 


- This hypothesis was derived from anecdotal evi- 
.dence, but note that the top nine of Ernst & Young’s list 
of top-ten companies by 1993 market valuation (G. Steven 
Burrill and Kenneth B. Lee, Jr., 1994 p. 54) were located 
and founded in regions richly endowed with venture cap- 
ital firms: Boston (3), San Francisco (3), Los Angeles 
(1), San Diego (1), and Seattle (1). 

13 Jn the alternative version of Table 2 (see footnote 9 
above), the coefficient on the lagged dependent variable 
was nearly as large as in Table 2, so the significant positive 
coefficient does not arise from firms born 1976—1980 ap- 
pearing in both the current and lagged dependent 
variables. 








































tion effects strengthen the impact 
mentals on regional development. Hoyi.™ 
the statistical properties of poisson reenn a 
with lagged dependent variables are gq 
problematic so such regressions and 
timated standard errors should be yj 
cautiously. É 
. In.conelusion, the intellectual human & 
variables play a strong role in deter 
where the U.S. biotech industry deyi 
during the 1980’s. We have been able toi 
tify particular star scientists who a 
play a crucial role in the process of s 
and geographic agglomeration over an 
that which would be predicted based o’p 
versity reputation and scientists suppo 
federal grants alone. The strong positive’: 
of venture capital variable reported previous 
is not supported for firm births. Indeed; 
data tell us that there were fewer fir 
founded, other things equal, where there 
venture capital firms. It is left to futur: 
search to explore whether firms which até 
sociated with particular star scientists or W 
midwifed by venture capitalists are more 
cessful than other firms.’ i 


B. The Annual Model 


We next report analogous poisson rèi 
sions exploiting the panel nature of our d 
set with observations for the 183 regions 5 
each of the years 1976 through 1989. Tais 
3 and 4 report poisson regressions for this 
tire panel. i 

Column (a) of Table 3 reports the 
using only the counts of stars and theif 
orators active each year in each regio: 
with the long-run models in Table 2, c45. 
nation of the data suggested that these et 
particularly for stars—were nonlineat,2, 
add squared values in column (b). Ags 
the number of stars increases, their ma 
contribution diminishes eventually P 
through zero. E 

These nonlinearities might reflect the 
clining value over time of-the-intellec 
man capital as we have measured it. B 


14 See Zucker et al. (1994) for our first effort p, 
the determinants of success of firms after b 
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act GEROA : È i ; LANNY AL POISSON REGRESSIONS ON THE BIRTH OF BIOTECH-USING FIRMS BY REGION AND YEAR, 1976—1989 


SAU 
ent. Hoe 
on restate Q (b) ©) @ (e) () 
are § —1.591*** —1.918*** —2.148*** —4.447*** —4,.49 1*4 —4.687*** 
s and (0.032) (0.041) (0.057) (0.226) (0.349) (0.565) 
De x 
ld b s, TE $ctive in the region and 0.157*** 0.529*** O.270** . ww — _ -O.361** a 0.282** 
: . (0.020) (0,051) (0.088) —_ (0.080) (0.103) 
h E herstors active in the region  0.043+**  0,083* 0.047 = 0.013 0.032 
- det Man pote i aie (0.013) (0.035) (0.049) = (0.047) (0.052) 
itry ka active in the region and _ —0,022*** —0.014* — —0.015** —0.014 
n abl eat a — (0.0023 (0.006) — (0.005) (0.008) 
vho anno A Mlaborators active in the = -0.001 0.000 = 0.000 0.001 
ss of Sit year? = (0.001) (0.001) — (0.001) (0.002) 
PERE : 
over añ i active in the region and — — —0.219 — —0.298** —0.245 
based t DUMMY 1986-89 = = (0.113) _ (0.102) (0.128) 
s supporti cnllaborators active in the region — — 0.117 — 0.115 0.027 
g po : X DUMMY 1986-89 = a (0.067) = (0.064) (0.081) 
rted pre active in the region — — 0.006 — 0.009 0.007 
as. Ind Ex i DUMMY 1986-89)? = — (0.007) — (0.006) (0.008) 
2 gollaborators active in the i — — —0.001 — —0.001 0.001 
yere there a fe | and year x DUMMY 1986-89)" — — (0.002) — (0.002) (0.002) 
ft to futin Sde fop-quality universities in the = = 0.444*** ses 0.472"** —0,462*** 
s whic S zahl = s (0.125) — (0.095) (0.109) 
ienti akey faculty with federal support in — — 0.625*** — 0.982*** 0.930*** 
ientists’ = y 
s are mores ain 1979-80 — — (0.093) — (0.094) (0.093) 
; venture capital firms in the = = = 0.019%*  —0,028***  —0.024** 
and year’ — — — (0.007) (0.006) (0.008) 
ydel Eal employment (all industries) in the — — — 0.173***  —0.081 —0.117* 
pza and year =e = = (0.051) (0.048) (0.055) 
poisson, r at SSeS Wages per job in the region and — — — 0.153*** 0.125*** 0.132*** 
ure of ourga k Re — _ — (0.010) (0.016) (0.017) 
183 regione S če ratio (Standard & Poors — _ = —0.024 —0.026 -0.017 
sh 1989.5 pit ee: A — — — (0.016) (0.026) (0.039) 
ions fort active in the region at end - — — — = z 0.020 
year , oe a = ae an (0.013) 
sorts th : active in all U.S. atendof ° 3 — i = ies ans —0,000 
and theif i. a — — — — — (0.000) 
>ach regiongk: Aer bitech firms in the region for — — — — — 0.054 
Table 7 = — — — = (0.034) 
it thesee! { —1677.0 —1429.1 —1274.3 —1669.5 — 1202.3 —1184.6 
Tsi ikelihood . —2238.5 —2238.5 —2238.5 —2238.5 —2238.5 -2238.5 
a 
; their. mape 7 TR baer: errors (adjusted by Wooldridge, 1991 Procedure 2.1) are in parentheses below coefficients. 
2 Ik i i 1981, the number of venture capital firms in a region is held constant at the 1981 level to avert simultaneity problems. 
mtually:p2 anty different from 0 at the 5-percent level. 
ye w Š Hy different from 0 at the I-percent level. - orano tu e 
t reflec ta ie y different from 0 at the 0.1-percent level. 
2 intel 
redit. ` 


ledge diffuses we expect that more in birth of firms after 1985 than before. This 
Stars will result in less and less pay- is illustrated in column (c) of Table 3 where 
ort ne of them if he or she were to start we add four interaction terms in which these 
after bith EEE ou Indeed stars are less likely to result counts and their squares have been multiplied 
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by a dummy DUMMY1986-89 equal to 1 
during 1986—1989 and 0 otherwise, as well as 
the other intellectual human capital terms. 
During 1986-1989 the positive effect of stars 
is sharply reduced while that of collaborators 
more than triples.'> Nonetheless, we should 
view this inference cautiously since the sig- 
nificance values of the interaction terms for 
stars and collaborators with DUMMY 1986- 
89 fall between 0.10 and 0.05, except for stars 
in the full fundamentals model in column (e) 
where the stars interaction term is significant 
at the 0.01 level. | 

Thus, we see that (at least during the first 
decade of this industry) localities with out- 
standing scientists having the tacit knowledge 
- to practice recombinant DNA were much more 
likely to see new firms founded and preexist- 
ing firms begin to apply biotechnology. There 
is some evidence that as knowledge about 
gene splicing diffused and the tacit knowledge 
lost its scarcity and extraordinary value, the 


training function of universities became more _ 


important relative to the attraction of great sci- 
entists to an area. It is interesting that the quad- 
ratic term for stars is negative, suggesting 
diminishing returns (or possibly just propor- 
tionately fewer, larger firms) rather than the 
increasing returns suggested by standard 
views of knowledge spillovers which posit un- 
internalized, positive external effects from 
university scientists.'° In the same regression 
in column (c), we see that, beyond the iden- 
tified stars and collaborators, university qual- 
ity and federal support are also significant 
measures of intellectual human capital rele- 
vant to firm founding. 


15 To compute the effects of stars in the 1986-1989 
period, we need to add the coefficients of the number of 
active stars and the coefficient of the same variable inter- 
acted with DUMMY1986-89 and then do the same for 
the two terms involving the squared values of these vari- 

-ables. An analogous approach yields the effect of collab- 
orators during 1986-1989. We examined also interactions 
with dummy variables for 1976-1980 and with a time 
trend. Since the coefficients were very small and statisti- 
cally insignificant for interaction terms involving 1976- 
1980 dummies, we believe the reported form more 
accurately reflects the time or diffusion dependence than 
a negative trend throughout the period. 

16 We are indebted to Jeff Armstrong for this point. 






























Column (d) of Table 3 leads to the a 
conclusions with panel data as foung we 
same column in Table 2: The economie t/a 
ables enter significantly with the eXpectéd gam 
if the intellectual. human capital variable; 
omitted from the regression. However, 
the previous long-run case, We can now efg; 
the earnings-price ratio.” Here thig vane 
enters with the correct sign, but does not 
reach the 0.10 level of significance, +; 

Column (e) of Table 3 presents the an 
full fundamentals model incorporating thek- 
tellectual human capital and other variag 3 
The results for the intellectual human capiti 
measures are robust while the sign of the ¥& 
ture capital variable turns significantly nem : 
tive as in the long-run model an 
employment variable becomes i insigni 
(and negative). 

Column (f) of Table 3, analogously to få 
ble 2, adds a lagged dependent variable toe 
full fundamentals model. We also included de | 
one-year lagged regional and national coe He j 
of firms using biotechnology as dynamic inks: 
ences reflecting local agglomeration effo: 
and market competition effects, respectively: 
None of the three dynamic variables enter 2: 
nificantly although their signs are consist pe 
with some geographic agglomeration. P; 

Thus, taken‘ as a whole the results § 
marized in Table 3 support the strong fh 
intellectual human capital variables in &¢ 
mining the development of the American 4 
tech industry. KER A 

The role of the economic variables, parse 
ularly the number of venture capital ao 
the region, is explored further in Table 4, Te a 
table presents representative results for 11% $ 
in the entrant and incumbent subcomp 
of firm entry into biotechnology. We 
columns (a) and (b) that if only the eco 
variables are introduced we get 
pected signs at appropriate signifi 
[except for employment in (a) an 
eamings-price ratio in both], including 8 
consistent with conventional wisdom ! 
number of venture capital firms has 4 


7 The earnings-price ratio had to be cee a a 
these analyses because it is available only nations 
time. 
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TABLE 4--ANNUAL POISSON REGRESSIONS ON THE BIRTH OF BIOTECH-USING ENTRANTS 





AND INCUMBENTS BY REGION AND YEAR, 1976—1989 
(a) (b) (c) (d) (e) (f) 
Entrants Incumbents Entrants Incumbents Entrants Incumbents 
—4.726*** © ` —5,798**¥ ~4.845%** —5,673*** —4.928*** ~ 5 2IQ*H* 
(0.284) (0.563) (0.409) (0.902) (0.669) (1.285) 
; ative in the region — = 0.414*** 0.323 0.351** 0.242 
‘a = = (0.095) (0.165) (0.124) (0.169) 
2713 ors active in the = z —0.006 0.000 0.012 0.019 
panis = = (0.053) - (0.105) (0.059) (0.101) 
active in the region — — —0.016** ~0.016* —0.017 -0.015 
zy E = (0.006) (0.008) (0.009) (0.011) 
A arahorators active in as = 0.001 0.002 0.000 0.001 
and year)? & = (0.002) (0.003) (0.002) (0.003) 
active in the region — — —0.227* —0.519* —0.196 —0.456 
x DUMMY 1986—89 = — (0.113) (0.237) (0.147) (0.251) 
aborators active in the = = 0.096 0.233 0.011 0.144 
aad year X = 5 (0.071) (0.141) (0.090), (0.153) 
1Y 1986-89 
Shes stays active in the region = S 0.007 0.018 0.006 0.015 
sie x DUMMY 1986- = = (0.007) (0.010) (0.010) (0.013) 
aiie collaborators active in = = —0.00! —0.004 0.001 —0.002 
n Like pegin and year X — — . (0.002) (0.003) (0.003) (0.004) 
EAREMY 1986-89)? ! 
Seeker top-quality universities in e= = 0.440*** 0.479* 0.410** 0.447 
in 1981 = = (0.110) (0.205) (0.126) (0.238) 
alty with federal = = 0.973*** 1.114% 0.932*** 1.041%% 
the region in 1979- = A (0.112) (0.296) (0.107) (0.295) 
jefdure capital firms in 0.023** 0.006 —0.029*** —0.027* —0.024** —0.024 
aü and year’ (0.009) (0,013) (0.007) (0.012): (0.009) (0.013) 
vent {all industries) 0.128 0.296** -0.110 —0.052 —0.149* —0.078 
and year (0.067) (0.098) (0.058) (0.098) (0.067) (0.103) 
per job in the 0.156*** 0.139*** 0.123*** 0.113** 0.127##* 0.114** 
t (0.012) (0.024) (0.018) (0.039) (0.020) (0.040) 
tatio (Standard & -0.036 —0.033 —0.022 —0.056 —0.016 —0.082 
for year (0.021) (0.043) (0.031) (0.070) (0.046) (0.092) 
$Dh active in the region — — — _— 0.023 0.024 
ious year ax = = = (0.015) (0.025) 
active in all U.S. at — pe — — . —0.000 —0.00! 
mus year a E = = (0.000) (0.001) 
firms in the um — 2 ee 4. 0.037 0.055 
t ptevious year ine nd — en (0.041) (0.061) 
~1265.1 —486.3 ” 945,50" —386.8 ~~ -9358° ` —382.9 
ood _ —1628.7 ~607.9 — 1628.7 —607.9 — 1628.7 607.9 


t alira errors (adjusted by Wooldridge, 1991 Procedure 2.1) are in parentheses below coefficients. 
ee 1981, the number of venture capital firms in a region is held constant at the 1981 level to avert simultaneity problems. 
different from 0 at the 5-percent level. 
y different from 0 at the 1-percent level. 
uy different from 0 at the 0.1-percent level. 
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cantly positive effect on the birth of new firms 
but an insignificant effect on the birth of sub- 
units of existing firms which would not nor- 
mally be financed by venture capital firms. The 
full fundamentals model is reported in col- 
umns (c) and (d) for births of entrants and 
incumbents, respectively, which is to be com- 
pared to column (e) for all firm births in Table 
3. Again, in the-presence-of intellectual human 
capital the simple economic story does not 
hold up: the coefficients of venture capital 
firms and total employment turn negative, sig- 
nificantly so in the former case. Similar results 
are obtained in the dynamic versions of the full 
model reported in columns (e) and (f) of Ta- 
ble 4. The robustness of the negative venture 
capital coefficient remains a puzzle for future 
work, particularly in view of Yolanda K. 
Henderson’s (1989) evidence that, despite 
some significant localization, most invest- 
ments by venture capitalists cross regional 
boundaries. 


IV. Summary and Conclusions 


The American biotechnology industry 
which was essentially nonexistent in 1975 
grew to 700 active firms over the. next 15 
years. In this paper, we show the tight con- 
nection between the intellectual human capital 
created by frontier research and the founding 
of firms in the industry. At least for this high- 
tech industry, the growth and location of in- 
tellectual human capital was the principal 
determinant of the growth and location of the 
industry itself. This industry is a testament to 
the value of basic scientific research. The num- 
ber of local venture capital firms, which ap- 
pears to be a positive determinant when 
intellectual human capital variables are ex- 
cluded from the regressions, is found to de- 
press the rate of firm birth in an area, perhaps 
due to the role of these venture capital firms 
in packaging a number of scientists into one 
larger firm which is likely to go public sooner. 

We conclude that the growth and diffusion 
of intellectual human capital was the main de- 
terminant of where and when the American 
biotechnology industry developed. Intellectual 
human capital tended to flourish around great 
universities, but the existence of outstanding 
scientists measured in terms of research pro- 


_ cates or foreign and others of which h 


























ductivity played a key role over, abo 
separate from the presence of those 
ties and government research funding to 
We believe that our results provide pé: 
sight into the role of research univer; 
their top scientists as central to the fi 
of new high-tech industries spawned É 
entific breakthroughs. By being able tọ% 
titatively identify individuals with the ane ee 


breakthroughs, we have developed new 
ficity for the idea of spillovers and in pa 
raised the issue of whether spillovers 
viewed as resulting from the nonappro 
ity of scientific knowledge or from th 
mizing behavior of scientists who hay 
ability to appropriate the commercial frig 
their academic discoveries. 


DATA APPENDIX 


The data used here are generally in peg 
form for 14 years (1976-1989) and: 


going project on ‘‘Intellectual Capital, T 
nology Transfer, and the Organization 
Leading-Edge Industries: Biotechno. 
will be archived upon completion of the pr 
ect in the Data Archives at the UCLA Instit 
for Social Science Research. A full descrif 
of the data is available from the autho 
request. 


Biotechnology Firms 


The starting point for our firm data sé 
ered the industry as of April 1990 and3¥; 
purchased from NCBC (1991), a priv 
which tracks the industry. This data 
tified 1075 firms, some of which were Sy: 


or merged. Further, there were a sig 
number of firms missing which had 
prior to April 1990. For these reasons 
tensive effort was made to supplem 
NCBC data with information from 
(1989-1993) and an industry data 
vided by a firm in the industry which * 





























iJ 
istor of the Bioscan data set (Cetus 

aa y counted-entry of firms by add- 
= for each year and region the number of 
unded and incumbents first using 
logy. A few special cases should be 
Where a firm enters the data set due to 
"gerger of a entrant and another firm, we 
Xt for the purposes of this paper as a con- 
of the original entrant and not a new 





¥ fms already in the data set merge and one 
“ttes with the other(s) absorbed, the en- 
counted as the continuing enterprise 


j “gdsatified as described in the text. Individual 
wtentists are linked to locations through the 


genetic sequences is recognized by 
K's assignment to an article of a ‘‘pri- 
cession number’ to identify each. The 
ional stars added to the 315 with more 


Eo- : ajor journals listed in GenBank as 
Sieh enetic-sequence discoveries for 

© Of more of our 327 stars were listed 
(A small number of unpublished 
articles appearing in proceedings 
pea 4nd obscure journals were excluded 
ie icles were assigned unique article 
and collected by hand. For each 
atst ID numbers are used to iden- 
Order of authorship and the institu- 
“on and location for each author 
tle. This hand coding was neces- 
Under the authorship traditions 
ds;‘the head of the laboratory who 


t presticinne anthas feannanth: 
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the hand soding-detailed below.) All - 
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authors on 18.3 percent of the articles and last 
authors on 69.1 percent of the 4,031 articles 
remaining after excluding the 30 sole-authored 
articles.'* Unfortunately, only first- and/or 
corresponding-author affiliations are available 
in machine-readable sources.’ 

The resulting authorship data file contains. 
19,346 observations, approximately 4.8 au- 
thors for each of the 4,061 published articles. 
Each authorship observation gives the article 
ID number, the order of authorship, the sci- 
entist ID number of one of our stars and col- 
laborators, and an institutional ID number 
for the author’s affiliation which links him 
or her to a particular institution with a known 
zip code as of the publication date of the 
article. 


Citations 


We have collected data for 1982, 1987, and 
1992 on the total number of citations to each 
of our 4,061 published articles listed in the In- 
stitute for Scientific Information’s Science Ci- 

` tation Index (1982, 1987, 1992). These 
citation counts are linked to the article and au- 
.thorship data set by the article ID number. The 
citations were collected for articles if and only 
if they appeared in the article data set; so sci- 
entists are credited with citations only insofar 
as they are to the 4,061 articles reporting 
genetic-sequence discoveries and published in 
_major journals. 


Universities 


Our university data set consists of all U.S. 
institutions listed as granting the Ph.D. degree. 
in any field in the Higher Education General 
Information Survey (HEGIS), Institutional 
Characteristics, 1983—1984 (U.S. Department 

-of Education, National Center for Education’ 


` 


'8 This positional tradition holds across national bound- 
aries: As a percentage of articles coauthọred by their fel- 
low nationals, American stars are 16.4 percent of first 
authors and 71.2 percent of last authors, compared to 21.2 
percent and 63.1 percent, respectively, for Japanese, and 
19.7 percent and 69.2 percent for other nationalities. as 

19 The Science Citation Index lists up to six of the af- 
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Statistics, 1985). Each university is assigned 
an institutional ID number, a university flag, 
and located by zip code based on the HEGIS 
address file. Additional information described 
in the text was collected from Jones et al. 
(1982) for those universities granting the 
Ph.D. degree in biochemistry, cellular/molec- 
ular biology, and/or microbiology which we 
define as ‘‘biotech-relevant’’ fields. 


-~ ~+ ~ Research Institutes and Hospitals 

For those U.S. research institutions and hos- 
pitals listed as affiliations in the article data set, 
we assigned an institutional ID number and an 
institute/hospital flag, and obtained an address 
including a zip code as required for geocoding. 
No additional information has been collected 
on these institutions. 


Venture Capital Firms 


We created a venture capital firm data set 
by extracting from the Pratt (1982) directory 
the name, type, location, year of founding, and 
interest in funding biotech firms. This infor- 
mation. was extracted for all venture capital 
which were legally permitted to finance start- 
ups. This latter requirement eliminated a num- 
ber of firms which are chartered under 
government programs targeted at small and 
minority businesses. This approach accounts 
includes founding date of firms appearing in 
the 1982 Pratt directory, excluding those firms 
that may have either entered thereafter or ex- 
isted in earlier years but exited before the di- 
rectory was compiled. 


Other Economic Variables 
Total employment and average earnings 


per job by region and year are as reported by 
the Bureau of Economic Analysis based on 


county level data in U.S. Department of ~ 


Commerce (1992b): Total employment is 
from Table K, line 010 (in millions of per- 
sons). Average earnings is from Table V, 
line 290 (wage & salary disbursements, 
other labor income, and proprietors income 
--per-job in thousands of current dollars), de- 
flated by the implicit price deflator for per- 
sonal consumption expenditures. The annual 


ES ee ee ae 
Arrow, Kenneth J. ‘“Economic Welfare a; 




























data for the implicit price deflatg. fe ie 
sonal consumption expenditures w 

from U.S. Department of Commerce 
p. 247, line 16) as updated in the Jy 
Survey of Current Business, (p. 92, }j 
The S&P 500 earnings-price ratio w: 
from CITIBASE (1993), series FSEXp 
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E amic competitive models are interesting 
y Enterpris i for studying industry behavior over time. 
a wh there is some overlap, such models 
ngtry evolution can be divided into two 
; earning (or internal shocks) models 
a extemal shocks models. 
ning models focus on idiosyncratic 
Seeks that are internal to the firm. For ex- 
oyan Jovanovic’s (1982) passive 
ng model posits that firms naturally ac- 
noisy information about their efficiency 
y produce. Firms that receive too many 
arable signals conclude they are ineffi- 
Gea, are scrapped, and are replaced by new 
aats, Richard Ericson’s and Ariel Pakes’ 
4295) active learning model focuses on re- 
seh and development components of in- 
try evolution. Firms make investment 
éiaions knowing that their future productiv- 
4 stochastic function of their current pro- 
fity and their current level of investment. 
kfm. that fall too far behind in the technology 


eaming models—see also Hugo 
openhayn (1992) —external shocks 
mphasize common shocks faced by 
HS; Examples of external shocks in- 
fe and shocks, factor price shocks, 
change rate fluctuations. If firms are 
5 assumed by Avinash Dixit (1989) 
übson (1992), then they will be 
‘affected. If firms differ, as in 
£ (1991) model of endogenous 
: oe then they will be affected 
v=i¥atlous models of industry evolution 
ically inconsistent, rather they 


of Economics, Brigham Young Univer- 
-- We are grateful to the Silver Fund 
acquisition of the COMPUSAT data and 
or helping us access it. We also thank 
Julie Elston, Mark Roberts, and two 
: rees for their suggestions, as well as 


sunk Costs and Firm Value Variability: Theory and Evidence 


" By VAL EUGENE LAMBSON AND FARRELL E. JENSEN* 


focus on different phenomena. The result- 
ing theoretical insights generate several 
predictions that a small body of empiri- 
cal work on industry dynamics has ad- 
dressed. For example, Timothy Dunne et 
al. (1989) document the importance of 
entry and exit while pointing out some 
intriguing empirical regularities that seem 
consistent with the dynamic competitive 
approach. Pakes and Ericson (1998) con- 
trast some empirical implications of passive 
and active learning models and conclude 
that passive learning is more important 
in retailing whereas active learning is 
more important in manufacturing. Lambson 
and Jensen (1995) argue that agricultural 
data are consistent with external shocks 
models. 

This paper reviews the predictions of dy- 
namic competitive models regarding the vari- 
ability of firm value and subjects them to 
empirical testing. Learning models predict that 
intra-industry firm value variability will be 
greater in higher sunk cost industries. External 
shocks models predict that intertemporal firm 
value variability will be greater for firms in 
higher sunk cost industries. 

The empirical results are consistent with 
both the learning models and the external 
shocks models. Specifically, stock market 
prices and publicly available debt positions 
provide estimates of firm value while the 
book value of property, plant, and equipment 
(capital costs) serves as a proxy for firm- 
level sunk costs. Regressions of firm value 
variability (whether intra-industry or inter- 
temporal) on industry averages of capital 
costs yield significantly positive coeffi- 
cients. Controlling for firm size does not al- 
ter the conclusions. 

Section I reviews the implications of 
dynamic competitive models pertaining 
to firm value variability. Section II outlines 
the empirical framework and describes the 
data. Section III contains the empirical 
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I. Some Implications of Dynamic 
~ " Competitive Models 


Formal descriptions of various dynamic 
competitive models are in the papers cited 
in the introduction; here a less formal dis- 
cussion will suffice. Dynamic competitive 
models endow expected-present-value- 


maximizing firms with infinite horizons and — 


place them in an uncertain environment 
where some actions entail sunk costs. Spe- 
cifically, at the beginning of each of count- 


ably infinitely many periods, firms observe 


current conditions and then make their entry 
and exit decisions. A new firm enters if its 
expected present value of being in business 
(given an optimal exit rule) exceeds its entry 
cost. Similarly, an existing firm exits if its 
expected present value of remaining in busi- 
ness (and optimally exiting later) falls below 


_ its scrap value. Firm i’s entry cost and scrap 


value will be denoted ŒE; and S; , respectively. 
Their Hieierence; E; — S;, is firm i’s sunk 
cost. 

After firm i enters, it is buffeted by 
events that affect its profitability. These 
events are characterized by the three sto- 
chastic processes— y; , ¢, and y—described 
below. The ż™ period realizations of y;, 
y, and y are denoted y,,, y,, and y,, 
respectively. 

The random variable y;,, emphasized by 
learning models, is idiosyncratic to firm i. It 
incorporates internal shocks and any other 
heterogeneity. Jovanovic (1982) would in- 
terpret y, as a random signal that depends 
positively on firm i’s exogenous and unob- 
served efficiency: firms with higher y,, val- 
ues believe they are more efficient and thus 
have higher expected values. Ericson and 
Pakes (1995) would interpret y;, to be firm 
i’s efficiency as a function of past R&D suc- 
cesses: the current level of efficiency y,,is a 
stochastic function of the previous level of 
efficiency y;,-, and the previous R&D in- 
vestment decision. l 

The random variable y,, emphasized 
by external shocks models, is common 
to all firms. Dixit (1989) would in- 
terpret y, as factor and output prices, 
which evolve exogenously. Lambson (1991, 
1992) would interpret y, as factor prices, 


present) value of firm i at time t. The fi 





























prices.’ 
Finally, the random variable y, d 
number of existing firms and their ch: 
istics after entry and exit decisions haya 
implemented in period ż. The value ¢ of. 
pends on y,_, and on firms’ reactions tg 
and y; realizations. 
Let Vi Vito Pr», yr) denote the (ex 


definition of V; differs across the various 
els, but in each case it can be derived from 
solution to a dynamic programming 
that specifies the firm’s optimal producig 
and exit rules given the stochastic o 
and the equilibrium behavior of the 
firms. Dynamic competitive models make 
dictions about both intra-industry and 
temporal firm value variability. Intuitively, 
basic insight is that entry and exit put 
and lower bounds on firm values: hi 
values tend to provoke entry and low 
ues tend to provoke exit. Where sunk co = 
larger, however, firm values can rise 
without provoking entry and fall more withesé: 
provoking exit. Thus, the range of firm yaaa] 
should be larger where sunk costs are highéé| 
Predictions about intra~industry fimi 
variability arise from differences in yı Æ 
firms, and hence are naturally associated ¥# 
learning models, Let J, be the set of fimti% 
industry 7 at time t. To begin simply, supè} 
that all firms in and potential entrants tō 
dustry Z have the same entry cost Ë and. 
same scrap value S,. Further assume 
firms with the highest values can be 
imitated by entrants—so that maxes 
E,—and that the firms with the lowest 
are indifferent between exiting and re! 
in business—so that min; ¢,, Vi = Sr 
these assumptions, the range of firm V 
industry I at time t—defined by Ri 
max; e 1, Va — min; e n Vi-—satisfies ` 


(1) R:(1, t) = E — Sı 


! Here there is some overlap with ase i 
Lambson ( 1991) would interpret y; as 2 const? 
dexes firm i’s technology choice. Firms that 
different technologies are affected differently 


in y. 











































senizeg 1) predicts that regressing intra- 
? firm value variability R; (7, £) on the 
y, descr sunk cost Er — Sı will yield a co- 
‘heir chara of-one. : 
ions haye & nately, the assumptions underlying 
value (1) are stringent ones. To see the ef- 
actions to a: elaxing them, reinterpret E; and S; as 
= gverage entry cost and average scrap value, 
the (exp tively, of incumbents in industry J. Add 
et. Th industry /’s average sunk cost, E; — 
he vari cigs gewtite RU, t) = maxes Vi 
derived 
mming p : B es ote 
imal prody (I;.t) = (max V;, — Er) + (E — Sı) 
hastic ie 


or of os = 

1odels make aa — (min Va — 57). 

lustry anda l eH 

y. Intuiti fight-hand-side term is how much the 


sful firm’s value exceeds the av- 
try cost. The second term is the av- 

sink cost for existing firms. The third 
afelleċis how much the least valuable 
a's Value exceeds the average scrap value. 
ven the assumptions underlying equation 
#1), the: first and third right-hand-side terms 
ön (2) are zero and equation (2) re- 
quation (1). More generally, the first 


mees in be of either sign. (It will be positive 
ly assoc ‘good and the most fortunate in- 
he set. Joys inimitable advantages due to 
n simp outcomes, access to unique fac- 


jal entraft ause potential entrants have high 
ry cost. ‘will be negative if times are so 
er assum the most successful firms have 
1es can Similarly, the third term may be 
hat max; nx FSign. (It will be positive, for example, 
1 the lo * Eems have the same scrap value and the 
ting an € firm strictly prefers to remain in 
an Vi be negative, for example, if 


uitable firm has a lower than aver- 
ue and is close to indifferent be- 
and'retndijiing.) As long as the 
terms are uncorrelated with the 
wever, equation (2) predicts 
on of R; (1, t) on E, — §; will 
© coefficient: the omitted vari- 
2 ~ E, and min; e; Va — S; will 


m pi mto the constant and the error 
T T these assumptions, the the- 


: a duel Rta A seoa higher intra-industry range of 


fher ennb anne I d-t 
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Predictions about intertemporal firm value 
variability arise from differences in y, over 
time and hence are naturally associated with 
external shocks models. Let R,(7) be the range 
of firm i’s value over time, that is, R,(i) = max, 
V;, — min, V;,. In the absence of intra-industry 


heterogeneity—that is, when yy is the same: 


for all i € J, and, furthermore, all firms in and 
potential entrants to industry Z have the same 
entry cost E; and the same scrap value S,— 
equilibrium requires | 


(3) E = V;, = $, 


for all i € J, and for all t because entry prevents 
V; from exceeding the entry cost and exit pre- 
vents V;, from falling below the scrap value. If 
firm i is observed long enough so that there are 
both entry-provoking good times (when V;, = 
Er) and exit-provoking bad times (when V; = 
Sz), then expression (3) implies 


(4) R,(i) = E- S. 
Specifically, regressing intertemporal firm 


value variability R,(i) on the industry’s sunk 
cost E; — S; should yield a coefficient of one. 


Intra-industry heterogeneity—reflected in 


different y; processes, different realizations of 
a given y; process or differences in entry costs 
or scrap values—-weakens the predictions of 
equation (4). (For example, the values of in- 
imitably efficient firms may rise above average 
entry costs and fail to fall to average scrap val- 
ues.) To allow for heterogeneity, reinterpret E; 
and $, as the average entry cost and average 
scrap value, respectively, of incumbents in in- 
dustry J. Add and subtract industry P s average 
sunk cost, E; — S;, to rewrite R,(i) = max, Vy — 
min, V;, as 


(5) R,(i) = (max Vy — E) + (E, — 57) 


a (min Va T Si). 


In the homogeneous case the first and third 
right-hand-side terms are zero, as argued 
above. More generally, if intra-industry het- 
erogeneity is additive over time, so that a more 


(respectively, less) efficient firm is always ` 


a Ramntamnét than n 


wee ee FPL UN wenden btn hee 


a 


pee 
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numeraire firm whose value fluctuates be- 
tween E; and S,, then the first and third terms 
cancel and the results are the same as for the 
homogeneous case. Unfortunately, this addi- 
tivity assumption is still very stringent. How- 
ever, as long as the first and third terms are 
uncorrelated with the second term, regressing 
R,(i) on E, — S, should yield a positive coef- 

+ficient: the omitted- terms will simply sweep 
into the constant and the error term. 

In summary, under the assumptions, dy- 
namic models of industry evolution predict 
that intra-industry firm value variability is 
higher in high sunk cost industries and that 
intertemporal firm value variability is higher 
for firms in high sunk cost industries. 


U. The Empirical Framework and the Data 


Direct tests of the empirical implications 
discussed in Section I require data on sunk 
costs and firm values. Unfortunately, detailed 
data on sunk costs seem to be unavailable and 
detailed data on firm values are difficult to ob- 
tain except for publicly traded firms. However, 
by focusing on established firms and using a 
proxy for sunk costs, one can construct simple 
tests of the implications of dynamic competi- 
tive models for firm value variability. 

The data used in the study are from four files 
of Standard and Poor’s Compustat Services: 
the Primary, Supplementary, and Tertiary In- 
dustrial Files and the Industrial Research File. 
The Primary Industrial File has information on 
a sample of companies from the S&P Indus- 
trial, Utilities, Transportation, and Financial 
Indexes. The Supplementary Industrial File in- 
cludes information on additional companies 
listed on major stock exchanges. The Tertiary 


_ File contains information on banks, life in- 


surance companies, railroads, property and 
casualty insurance companies, real estate in- 
vestment trusts, and additional companies 
from the S&P Utilities, Transportation, and 
Financial Indexes. Finally, the Industrial Re- 
search File contains companies that have been 
deleted from other Compustat files because of 
acquisition or merger,-bankruptcy, leveraged 
buyout, reverse acquisition, or because the 
company no longer files with the SEC. The 
data, reported annually, cover the 20-year pe- 
riod 1973-1992 for 4,534 firms. The largest 

































firm had an average value over the 
riod of 363 billion dollars while the 
firm had an average value over the & 
riod of 2.8 million dollars. The 
ported average book value of pro 
and equipment was 118 billion d 
the smallest average book value wag yes 
to be zero. 

The dependent variables sugges 
models are the intra-industry and 
poral ranges of real firm value. Firmy 
millions of dollars) was defined as’ the’; 
market value of firm equity (that is, 
year-end price of a share of comm 
times the number of shares ne ral wee 
end of the fiscal year) plus the real boot 
of total liabilities, where real values | 
culated using the GNP deflator with 
the base year. (Although it would h 
preferable to have the market value of 


difference between the highest firm val 
the lowest firm value within a four-dig 


zero were discarded. (For the intra-inds 
range these correspond to industries for 

only one firm was reported for the yeah Pog 
the intertemporal range these correspo & | 
firms that were in the sample for only. 
year.) 


firm values will understate its theoretical 
terpart to the extent that a 
(e.g., new firms) are not publicly traded 
not in the sample for other reasons. LP. 
more, the four-digit SIC codes may BY sie 
fectly coincide with the theoretical notita, 
an industry. Even so, the measure 
good proxy for its theoretical coun 
the extent that the biases are not syst¢ 
The measure of the intertempo 
firm value almost certainly under Fy 
theoretical counterpart because the as we a 
that the observed firms have enjoy, 
good times and bad times (with V: * 
to their upper and lower extremes) 4 aut $ 







































° n? BS 
e. Firmal ‘hon 
fined asthe Pny 
(that is, the Gps 


e real book yay ssaple period may not be satisfied, especially 
| values WEC EPEE -i= firms that are only in the sample for a short 
EDn Nevertheless, it should be positively re- 
$ <kged to its theoretical counterpart and the un- 
timate should become smaller as the 
kash of time a firm is observed increases. 
“The independent variable suggested by the 
ary is the average sunk cost in each indus- 
Unfortunately, such costs are difficult to 
“assure, We used the real gross book value 
ia fulllions of dollars) of property, plant, and 
ment (capital costs) as a proxy for firm- 


d each year by using the GNP deflator 
41987 as the base year. In the learning 
context, we used the intra-industry av- 
‘of this proxy as a proxy for the average 
St in each industry. 

intertemporal context—in contrast to 
plifying theoretical assumption that 
-COStS are stable over time—the data re- 
that firms’ Capital costs change over time 
‘8608 build new plants, retool old plants, and 
AE Scrapped equipment from their books. 
toothed” the data by using the average 
sts for each firm over time as a proxy 
level sunk costs. We then used the 


etage sunk cost in each industry. 

d, capital costs probably over- 
Costs in two ways: they ignore de- 
zu iand they ignore scrap values. 

that depreciates away is no longer 
depreciation is difficult to mea- 
> available estimates are based on 
€finitions that have little eco- 
+ fing: Scrap values are even harder 


stry average of this proxy aS.a proxy . 
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Variable Coefficient 
Intercept 1,958.37, 2. 
K 9.71 
Intercept 1,625.21 

K 1.21 


t 


TABLE 1—SuNK COST REGRESSIONS 
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Standard error R Observations 
211.76 0.34 6,266 
0.17 
138.56 0.04 4,209 
0.09 


the learning model an observation corresponds to an industry in a given year, the dependent variable is the 

range of real firm value and Ķ is the intra-industry mean of firm-level capital costs. For the external shocks 

‘observation corresponds to a firm, the dependent variable is the intertemporal range of real firm value, and K 
„industry mean of the intertemporal mean of firm-level capital costs. 


to measure: one must know what the equip- ` 
ment and other embodiments of sunk costs 
would be worth outside the industry. On the 
other hand, capital costs probably understate 
sunk costs in other ways: other costs than cap- 
ital costs— such as the opportunity cost of the 
entrepreneurial time used to start a company, 
legal fees, etc.—are also sunk. We do not 
claim that capital costs are a good measure of 
sunk costs, only that, in the absence of good 
firm-level data on sunk costs, they are a good 
proxy: other things equal, firms with higher 
sunk costs probably have higher capital costs 
and the converse. 


Il. Empirical Results 


Learning models predict that regressing the 
intra-industry range of firm value at a given 
time on a proxy for the industry’s average sunk 
cost should yield a positive coefficient. Exter- 
nal shocks models predict that regressing the 
intertemporal range of a firm’s value on a 
proxy for its industry’s average sunk cost 
should yield a positive coefficient. The results 
of these regressions are in Table 1 and are con- 
sistent with the theory: the coefficient in each 
case is positive and highly significant. 

These results, however, do not distinguish 
the sunk cost theory of firm value variability 
from other conceivable size-related theories. 
Specifically, suppose that production in indus- 
try J requires a sunk cost of a; per worker. If 
L; is the number of workers employed by firm 
i then firm i’s sunk cost is E; — S; = Lio, and, 
other things equal, is higher for larger firms. 
This suggests an alternative interpretation of 





312 THE AMERICAN ECONOMIC REVIEW 


TABLE 2—SIZE REGRESSIONS 


Model Variable Coefficient 

Learning Intercept 1,120.42 
Range (L) 140.99 

External shocks Intercept 731.88 
L 168.08 


Notes: For the learning model an observation corresponds to an industry in a given year, the dependent y. 
intra-industry range of real firm value and Range (L) is the intra-industry range of firm-level employment, eae x 
shocks model an observation corresponds to a firm, the dependent variable is the intertemporal range of real 
and L is the intertemporal mean of firm-level employment. 


TABLE 3—-NORMALIZED SUNK COST AND SIZE REGRESSIONS 


Model Variable Coefficient 
Learning Intercept —2,648.91 
KIL 94.28 
Range (L) —9.95 
External shocks Intercept — 685.75 
KIL 11.56 
L —0.26 


Notes: For the learning model an observation corresponds to an industry in a given year, the dependent variable E 
intra-industry range of real firm value per worker, K/L is the intra-industry average of firm-level capital costs péf 
and Range (L) is the intra-industry range of firm-level employment. For the external shocks model an ob 
corresponds to a firm, the dependent variable is the intertemporal range of real firm value per worker, K/L is 


Standard error ` P 
226.12 0.31 
2.68 — 
115.30 0.35 
3.70 — 






Standard error R 
1,213.04 0.15 

2.86 — 

14.12 — 
679.23 0.07 

0.69 — 

20.94 — 


industry mean of the intertemporal mean of firm-level capital costs per worker, and L is the intertemporal mesfi ei : 


level employment. 


Table 1. Table 2 shows, not surprisingly, that 
intra-industry firm value variability is posi- 
tively related to intra-industry firm size vari- 
ability and that intertemporal firm value 
variability is positively related to firm size 
(where firm size is measured by employ- 
ment). Thus Table 1 does not rule out the pos- 
sibility that sunk cost is merely a proxy for 
firm size variability (in the learning context) 
or firm size (in the external shocks context) 
which, in turn, are related to the true deter- 
minants of firm value variability. 

Fortunately, the sunk cost theory of firm 
value variability can be distinguished from 
„other size-related theories because the predic- 
tions do not change when one.controls for size: 
sunk costs can be normalized by setting L; = 
1 in the previous paragraph and focusing on 


` a proxy for firm size variability in the 














sunk costs and firm value per work 
associated normalized regressions 
strongly positive sunk cost coefficient, 
this method of controlling for firm size B 
arguably inadequate. Since higher sut oe 
industries exhibit higher firm size va rie 

in the data, the sunk cost variable mS á = ; 


malized learning regression. Simil 
large firms are more capital intensive 
data, the sunk cost variable may 
proxy for firm size in the normali 
shocks regression. For these reasons, 


-° - - vaa 


? This is analogous to our previous work on # 
Lambson and Jensen (1995) —-which can be 
as normalizing with respect to land inputs. - 





































3 firm size variability and firm 
ay ured by the intra-industry range 
E mel employment and firm-level em- 
respectively) directly to the respec- 
alized regressions. As shown in 
-@ the sunk cost coefficients remain 
iy positive whereas the size coefficients 
act to Table 2) become insignificant. 
gests that firm value variability is ex- 
by sunk costs rather than other size 
ariables. 
farther evidence, note that the theory 
that the ranking of industries by their 
ustry firm value variability and the 
of industries by the average intertem- 
firm value variability of their firms 
be similar. Consistent with this, the 
gaman correlation coefficient for the two 
; pakiss (of 334 industries) is 0.88. 
feported regressions are very simple. It 
örant to emphasize that the theory sug- 
3 that they should be: the theory leads one 
šas on sunk costs as a major determinant 
tange of firm value. Whether it is caused 
t demand shocks, input price shocks, 
@anccs-in regulatory regimes, random out- 
m24°0f research and development, or 
He else, the range of firm value is con- 
by the difference between entry costs 
P values. Hence, a simple regression 
ge of firm value on a measure of sunk 
{precisely what the theory calls for 
‘variables can be accurately 


lude that sunk costs are an impor- 

lant of firm value variability, as is 
iby dynamic competitive models of 
lution. . 
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By IAN J. IRVINE AND WILLIAM A. SIMS * 


A standard problem in welfare analysis is 
that estimated demand functions, or labor sup- 
ply functions, which are sufficiently flexible to 
capture the variation in behavior observed in 
data, frequently cannot easily be integrated 
back to an explicit cost or utility function, de- 
spite satisfying the integrability conditions.’ 
Some functions are easily integrable to yield 
closed forms, but impose unreasonable restric- 
tions on the data. The consequent trade-off be- 
tween the generality of the econometric 
functions and the possibility of obtaining the 
corresponding cost or utility function has been 
examined exhaustively by Nicholas Stern 
(1986); in the context of labor supply. 

Recent practice, when a closed-form utility 
or expenditure function cannot easily be ob- 
tained from estimated, but well-behaved, de- 
mand functions,’ is to use numerical methods 
to approximate welfare measures (Yrjö 
Vartia, 1983; Kathy Hayes and Susan Porter- 
Hudak, 1987; Breslaw and Smith, 1995; 
Hausman and Whitney Newey, 1995). Be- 


* Department of Economics, Concordia University, 
Montreal, Canada, H3G 1M8. Irvine would like to thank 
the University of Sydney for the use of research facilities 
while working on this paper. 

' The integrability problem involves the conditions un- 
der which we can be sure that observable demands are the 
result of a utility-maximizing process. The sufficient con- 
ditions are called the integrability conditions and are de- 
veloped in Leonid Hurwicz and Hirofumi Uzawa (1971). 
In essence they are the conditions for a well-behaved de- 
mand system: that the demand functions are homogeneous 
of degree zero, add up, and have symmetric, negative 
semidefinite substitution terms, in addition to certain reg- 
ularity conditions. Even where these conditions are ful- 
filled, it may be-very difficult to find closed forms for cost 
and utility functions. The technique suggested in this pa- 
per can be used in such cases. 

? Jerry Hausman (1981 p. 666) shows, for a single 
price change, ‘*... that for many applications no ap- 
proximation is needed.” But as is pointed out in Jon 
Breslaw and J. Barty Smith (1995 p. 313), “... since 
this involves solving differential equations, Hausman’s 
technique becomes difficult to implement when the de- 
mand functions are complex or when there is more than 
one price change.” 



























cause of the inherent complexity of sy 
ods, however, various single-step p 
have alsọ been employed. One such 
is the Marshallian consumer surplus; 
can be used to approximate either 


cing 


A 


tive approach involves employing ą. 
approximation of either the indirect į 
function (George McKenzie and Ivor 
1976) or the cost function ( Andreii Ná MAEN- 
Colell et al., 1995 p. 89). The latter technige MAASKESI |: 
assuming a second-order expansion, iiye AMERS: 


initial prices as an approximation to thé 
but unknown, Hicksian demand when‘ 
lating the welfare change. Mas-Colell ¢# 
(1995 p. 90) suggest that this linear apog. 
mation to the Hicksian demand is both shee’ $ 
to apply and, in the case of small pay 9E 
changes, more accurate than the Marshalet 
consumer surplus as a measure of wèlla $ 
change. + 
In this paper we propose an alternative ë 
step procedure, which is at least as simp 
the two previously mentioned one-step = 
proximations (i.e., the linear Hicksis 
proximation and the Marshallian c 


procedure is based upon what is ter 
Slutsky compensated demand (or supply 
labor case), involves no numerical inte, 
and requires only an understanding of ¢! 
tary utility theory. ii 

In Section I we illustrate the ides! 
means of a geometric example. In Se 
we apply the method to a specifi 
behaved labor supply function W9 
cost and utility functions are rec 
only by means of numerical expansiom®: 
then examine- accuracy by conside 
where the cost function is easily obt! 
In Section III we develop the the 
properties of our measure. We sho 
is locally path independent and d 
sults on relative errors which Sufi 
findings from the numerical examp 

























tion IV, we relate our approach 
fHausman and Newey (1995). 


i tsky-Based Compensation 





microeconomics texts (e.g., Milton 

1962, or Eugene Silberberg, 1990) 
teise is to examine the relation- 
| Hicksian and Slutsky compen- 

curves. Figure 1 is typically 
xercises. We shall use the term 
sation to denote the income 
A required so that a consumer can 
dle of goods purchased prior to 
The amount of a good pur- 
n set of prices with such com- 
me Will denote by x5 = x‘(p, x°) 
ector of goods which the con- 
at the initial price vector p°. In 
> Compensation is the amount re- 
Particular level of indiffer- 
ted demand is x” = x”(p, 
As the initial level of utility 
farshallian demand will be 
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Quantity 


FIGURE 1. THE SLuTsky DEMAND 


denoted by x” = x™ (p, y) where y is income. 
The Slutsky demand is derived by substituting 
the vector product px° for y in the Marshallian 
demand. For a normal good, x, the relationship 
between these demands (Jacob Mosak, 1942) 
is that the Slutsky (S) is above, but tangent to 
the Hicksian (H) demand at the point where 
they intersect the Marshallian (M) demand*— 
point a in Figure 1.* 

When the Hicksian demand is known, the 
area to the left of the Hicksian demand 


3f x were inferior the Marshallian demand would be 
the steepest of the three demands and the Slutsky demand 
curve would lie everywhere beneath the Hicksian curve, 
except at the point of tangency. A technical derivation of 
the relationship between Hicksian and Slutsky demands is 
provided in the Appendix at the end of this paper. 

* The demands in Figure 1 come from a Cobb-Douglas 
utility function u = x,x, with a budget constraint x, + x, = 
8. The proximity of x* to x” is thus not simply constructed 
to support our approach. All figures are developed with 
Maple V R3 using exact functions, and are presented to 
scale. 
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represents the monetary value to the consumer 
of a price change. When this demand is de- 
fined for the initial level of utility, u°, the re- 
sulting measure is the compensating variation, 
CV, and when it is defined for the new (post- 
price change) utility level it is the equivalent 
variation, EV. When the Hicksian demand is 
unknown an approximation iş required. Since. 
the Slutsky demand is always obtainable from 
the Marshallian demand, our proposal is to use 
the Slutsky demand to approximate these mea- 
sures. In the case of a single price change this 
amounts to computing the area p°a fp’ 
Figure 1 as an approximation of the unknown 
area p°adp' when price falls from p° to p'.° 

To examine the relationship between the 
Slutsky-based measure of the CV and other 
methods, it is convenient to define the CV 
mathematically. Letting e denote the expen- 
diture function, the CV is given by 


(1) CV= e(p°, u°) — e(p', u’), 


where p is a vector of prices at the initial (p°) 
and final (p’) equilibria. Expanding e(p’, u?) 
around the initial price and utility combination 
by means of a Taylor series, and considering 
only one price change, we obtain 


o ,,0 
(2) e(p’,u°)=e(p®, u°) Je uw), 
Op 
2 0 
+0 5 IEP W) Ara S 


Op 2 
where R, is the remainder term in the series, 
and A is the price change (p’ — p°). If the 
quadratic terms alone form a good approxi- 
mation, then 


(3) CV = —x"(p®, w)A 


H 0 0 
5 ox (Pi u’) aa 
Op 


since the derivative of the expenditure func- 


tion is the Hicksian demand. ôx” (p°, u°)/Op ... 


5 To avoid repetition, we develop the theory using the 
CV only, although John Kay (1980) has noted the advan- 
tage of the EV over the CV. 























can be evaluated by the Slutsky Sh 


small changes in p. That.is: a, 
Ox" (p°, u 

(4) e °) 

% P 


Mry0 4,0 
wath aa: ee 


and this gradient is therefore obis% 
from a knowledge of the parame! i 
Marshallian demand function. This 4 
then, which is essentially what is 
Mas-Colell et al. (1995 p. 89), amo 
ing the tangent (Z) to the Hicksian 
Figure 1 at the initial equilibrium and 
imating the area p°adp’ by p°agp’: 


I. Application 


A desirable characteristic of labo 
functions is that they display flexibility exit 
different ranges of the wage w and une $ 
income m. This is what makes the linest bag px 
supply model of limited value for tax Rez AES 
analysis—particularly at the extremes f & $ 
wage and unearned income values (Mars 
King, 1987). We define the past t 


L is labor supplied and c is consumpti i 
oe 


A. A Quadratic Labor Supply Funciiée 


and unearned i income, m. 
(5) L = aw + Bm + dw’ 
+ pm? + vwm + Y- 


This function permits L to be e backward je ae 
ing at high wage rates—if à < 0. nae i 

aL /dm < 0 (e.g., more BMWs make it 

P AEREN less P Thes 


é The third one-step procedure, Marshals 
surplus, is measured as the area to the 
Marshallian demand (M). In Figure 1 this isr 
by the area p°acp’. 













‘to recovering the utility function 
demands or supplies is first to 
nditure function, then the in- 
be Sion by inversion. The direct 
-y fonction (if required ) is obtained by dif- 
ting the indirect utility with respect to 
lized price vector, and substituting 
, Following Hurwicz and Uzawa 
}the expenditure function can be ob- 
‘where possible) by integrating the par- 
erential equation 


that is presén Om __ L 

9), amounts foak.. 4 € aw ? 

licksian iy , 

brium an ‘erem is the expenditure function. For p = 


) this is feasible, and yields an indirect 
waity (Stern, 1986 p. 175) 


ion Bee NE 
¥F) “iv(w, m) = me™ +03 — O(w) 

ic of labore ae 
lay flexibil ` 
ge w and \ 
akes the lines 
ralue for tax ae s 
the extre r w+ 0.5vw? 
ne values ;! f; ((~aw- dw? — yje™ +°} dw. 
he underlyi ’ 
,), and the ip above is nonzero, recoverability be- 
n = c — WEA 3 complex as we examine below. How- 
is consumiptiong wz, the ‘Slutsky-compensated labor supply, 

l £2n be obtained easily from (5) by substi- 
r Supply F i -using the relation m = c? — wL?. 
u ply fun uate. ar, 
adnate W(a — pL? — 2uc°L® + ve?) 

+ W(X + uL? — vL?) 
1+ AW l 
°+ ue” + y) 
ywm +t Y. 
taw + aw? 

to be bac ta ha REN 
if <0.It oF 8) is Plottéd in Figure. 2 for a 
re BMW ‘Sl Coefficients.’ In order to estimate 
ctive). Th 

chosen are such as to make the func- 
dure, Mars brium, comparable to Hausman’s 
e area to Supply example. The equilibrium 
Figure 1 -15, and m = 8,236 is L = 1,790.2. 


ch yield this are: a = 1,500.0, 8 = 
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the welfare cost of a wage reduction, brought 
about by a tax rate change, it is straightforward 
to integrate (8) with respect to w to obtain the 
answer. For example, suppose the wage rate is 
reduced by 20 percent as a réSult of a tax, then 


the area to the left of L* in Figure 2 between 


(w = 4.15) and (w = 3.32) is a measure of 
the welfare loss. Essentially L5 is approxi- 


‘mating the unknown Hicksian labor supply 


function. l 
What makes the Slutsky demand tractable is 


`- that we do not have to integrate a differential 


equation of the form (6) with a nonlinear term 
in m present. It is the quadrature of the labor 
supply equation in m which makes the inte- 
gration to a cost function complex. We should 
emphasize that it is possible to obtain a cost 
and indirect utility function for the quadratic 
function given above by numerical methods, 
as shown by Hausman (1981 Appendix). 
These functions are given, respectively, by 


(Bt+uw)v? 1Wi+aws 


(9) 1) aaa aia UW + aW, 
and 
hW, — Wi 
1 nyse e, 
(10) u(w, m) hÝ, 
where h = —um + v(ß + vw)?/2, and the 


W’s are obtained by means of infinite expan- 
sions involving the coefficients in (5)-and 
prices. These expansions are defined in, for ex- . 
ample, Milton Abramowitz and Irene Stegun 
(1970 p.-686). Accordingly, the CV or EV can 
be computed by using the expenditure func- 
tion, evaluated at different values of the price - 
vector by.using a computer algorithm as de- 
scribed by Hausman. Of course we cannot de- 
rive a Hicksian labor supply by the application 
of Shephard’s lemma’to (9) without the use of 


numerical methods. 
`~ The conclusion we draw from this is that the 


exact recoverability of the cost and utility 
functions in this limited case of a simple quad- 
ratic function with only one price change 1s 


lS 5 Fi E E E E 
—0.01, \ = —90.0, u = —.000002, y = ~1,596.5, v= 
—0.0313. This set of values satisfies the integrability 
conditions. 
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500 1000 


FIGURE.2. THE QUADRATIC LABOR SUPPLY EXAMPLE 


not at all trivial and cannot be obtained exactly 
without infinite expansions. Hausman has 
been quite clear on this, though he has fre- 
quently been misinterpreted. The fact that the 
expenditure and utility functions can be writ- 
ten as above should not mask the fact that in- 
finite expansions underlie the coefficients, 
meaning that closed-form analytic solutions do 
not support (9) and (10). 

Correspondingly, if we wished to impose a 
cubic form on the data, or wanted to change 
more than one price in the system, or wanted 
to use expansions of finite order, the recover- 
ability of the cost and utility functions be- 
comes yet more complex or less’ exact. 

This then leads to the question of how good 
different methods are at approximating the ex- 
act measure. To test the exactness of the 
Slutsky demand or supply equation in com- 
puting the CV or EV we take two simple ex- 
amples in which the analytic cost functions, 
and therefore an exact welfare measure, are 
available. 


















1500 2000 


B. Accuracy 


Consider first the linear labor supply fest 
tion used by Hausman (1980, 1981): : es 


(11) L=aw+ ém+s. 


(12) 


v(w, m) 


— pôw a + s a ae ; 
= e m + 5 Ww 6 6? s 
where e is the exponential operator. 
sulting Hicksian supply function L 


a 
LË =— + e`% u, 


(13) > + eT, 


i DF 
where u can be either the initial (4 
(u') level of utility attained. Fin 
Slutsky labor supply, L5, is obt 






















g for m in (11) using the relation 
_ wL? 
- w(a — ÔL’) + ôc? + s. 


nY * 


~ get of values given in eae 
a: 8 the true CV is found to be 
1990, 1981) 
lü demand is —$1,238.3, an error 
Se ir of 1 percent. In contrast, the 
qusing the Marshallian function is 

5:4, an error of 6 percent.” 
a sécond example we computed the CV 
2 tated with a 50-percent increase in p, in 
=o avample underlying Figure 1, given in 
r ani We found the Slutsky error was 
i73 percent, the error associated with using 
jaz tangency to the Hicksian demand was 
4 percent, and the error associated with 


labor supply 


ee -» Inthis section, we compare the Slutsky 
i: - eaifaze approximation with the other two one- 
spproximations. First the theoretical re- 
y keGéaship between the errors from measuring 

£V-with the Slutsky and Marshallian demands 
jped in a manner which does not de- 
vthe functional forms of the demands. 
ing this we compare the Slutsky mea- 
ith ‘that based on the second-order ap- 
to the expenditure function. In 


t at $8,236, w° is 4.15, w! is 3.32. The esti- 
ofS are a = 495.1, 6 = —0.125, s = 765.1. 
yield utility of u” = ~27,387.2, and u’ = 


Fes 2© Of historical interest to note that Hausman’s orig- 
jet Be tthe CV incorporates a numerical error. His 
o e CV is —$2,056 and on the basis of this he 
neor of 44 percent to the Marshallian estimate. 
a measure of the true welfare loss, the 
1s not as faulty as Hausman proposes 
against using Willig’s approach is not 
evertheless, as a measure of the pure 
Of the CV, the Marshallian-based mea- 


P 
J operator 3 g 
nction L 


aa cantly from the true measure. Using 
“ 1) gasoline example, Irvine and Sims 

ere (is e Marshallian demand-based estimate 
amitt “ -An error in excess of 22 percent, whereas 
ained. F SUE results in an error of less than 1.1 


65 and the estimate of the CV based ` 
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addition, the theoretical path independence of 
the Slutsky approach is investigated. 


A. A Willig-Type Result 


Robert Willig (1976) derived a measure 


_ of the error associated with the use of a 


Marshallian, rather than a Hicksian, demand 
to estimate the CV or EV. Willig’s result can 
be illustrated, following Robin Boadway and 
Neil Bruce (1984 p. 218), with the use of 
Figure 1. A fall in price from p° to p’ yields 
a welfare gain, as measured by the CV, equal 
to the area p°adp’. The Marshallian approxi- 
mation to this is p°acp’, yielding an error acd. 
Geometrically this can be approximated by 
'hdcAp™ and an expression for this is easily 
derived: dc is the income effect associated 
with an income change of CV (=p°adp’). 
That is, dc =~ Ox/OyAy, where Ay = CV. 
Accordingly the Marshallian error is given by 


| ae 3 
(15) eae SWAR: 


where 7 is the income elasticity of demand. 
The error associated with using the Slutsky 
demand is given by adf. As before we can 
derive an expression for the distance df: The 
movement from c to f is attributable to the 
Slutsky compensation—in this case negative. 
The Slutsky compensation is defined by x°Ap 
(=p°aep') in the case of a single good. It 
therefore follows that the Slutsky income ef- 
fect, df, is due simply to an increase in income 
defined by the area aed. Accordingly the error 
associated with using the Slutsky demand is 


1 
(16) adf ~ 273 (aed) Ap. 


The area aed can be interpreted as a gain 
from the elimination of a deadweight loss if 
the price fall.resulted from eliminating a tax 
equal to Ap. Referring to aed as the dead- 
weight loss, DWL, we therefore have the result 
that the ratio of the Slutsky error to the 
Marshallian error is approximately DWL/CV. 


© Making triangular assumptions is equivalent to using 
the linear terms of a Taylor-series expansion. 
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A corresponding result can be derived for a 
price increase or for the EV. The magnitude 
of the ratio, for a given initial consumption, 
depends upon the price elasticity of the 
Hicksian demand. Specifically, the smaller its 
price elasticity the greater is the error from us- 
ing the Marshallian demand. In the limit, if the 
good in question is perfectly complementary 
with the aggregate of other goods, the Slutsky 
and Hicksian demands coincide and DWL, as 
measured by the Hicksian demand, tends to 
zero. $ 

While these results are remarkably straight- 
forward, they illustrate that there is no need to 
use Marshallian demands to estimate CV or 
EV, when the Hicksian demand cannot be ob- 
tained. If the Marshallian demand can be in- 
tegrated, so can the Slutsky demand, and it 
yields an estimate of the welfare change which 
is an order of magnitude smaller than that 
which comes from the Marshallian demand. 


B. The Second-Order Approximation to an 
Expenditure Function 


The other simple one-step approximation to 
welfare changes considered uses a second- 
order approximation to an expenditure func- 
tion at initial prices. This, in essence, involves 
using the linear function, Z, in Figure 1 as an 
approximation to the Hicksian demand. There 
are a number of reasons to expect this measure 
to underperform the Slutsky approximation. '! 
Mas-Colell et al. (1995 p. 90) point out that 
in cases where the price change is large, it is 
impossible to be sure that the linear Hicksian 
approximation is better even than the 
Marshallian consumer surplus approximation. 
This problem arises because as prices change 
nothing guarantees that this measure will be 
sensitive to demand behavior. This is obvi- 


' In the specific cases that we have explored, the Cobb- 
Douglas utility function (see Section II, subsection B), 
` the indirect addilog from McKenzie and Pearce (1976), 
and the case of quasi-linear preferences, the Slutsky mea- 
sure dominated the linear Hicksian approximation, for 


large and small price changes. In cases in whith the‘ 


Marshallian demand is linear, such as for example 
Hausman’s (1981) gasoline example and his labor supply 
function (see Section H, subsection B), the linear 
Hicksian and Slutsky approximations are identical. 


































ously not so with respect to the mi 
on either the Marshallian or Slutsky dă 
And of course, as was shown in thei 

section, the Slutsky measure perf 
than the Marshallian measure, glob; 


C. Path Independence 


An important property of Hick ® 
mands, which is of special concern wh: 
termining the welfare effects of my] 
changes, is path independence, 
acteristic of demand systems requ 
metric cross-partial price derivative 
Op; = ðxf'/ðp; . Slutsky demands po 
property locally. That is, at the co 
price vector p°, Ox? /Op; = Ox$/ap 
the functions x" and x° are tangent! 
path independence thus holds only lö 
should be noted that Marshallian. ¢ 
generally do not have this prope 
locally. x 


IV. Conclusion 


Two final points should be noted, The £ 
is that standard errors for our measures of 4 
or EV can be obtained readily. This:is 
from equation (8), since the standard é gi 
the coefficients a; are known from theiré& 
metric estimates. i 

The second point is motivated by Hausa 
and Newey’s recent contribution (19%: 
They propose that a nonparametric sp 
tion for the price and income portion ¢ 4 
mand function will generally yiel 
reliable estimates than the impositi 
pler forms. They develop a method: 
mating both a value for welfare chang 
associated standard error. If we 
Slutsky demand as a tool of analys 
polynomial of any order in both p 
income can be fitted to the data. An 
of welfare change can then be derived by! i 


12 An example approximating the CV 
multiple price changes, using the Slutsky 
tions, is provided by Irvine and Sims (1995) ayy 

13 From the equality of their first deriva’, 
follows that the linear Taylor-series expanst” 
are equal. 






























‘on m = px” in the case of a demand 
orm = c° — wL’ in the labor supply 
“This can be obtained by proceeding as in 
: Ta labor supply example given 
zeagardless. of the order of the polyno- 
in estimation, provided the integra- 


We ds by emphasizing that the pur- 
is paper is to propose an intuitive, 

onally simple and accurate measure 
re change in cases where estimated de- 
and supply functions do not yield an eas- 
goverable cost or utility function. We 
go claim that our approach is superior to 
= f'the iterative numerical methods pro- 
zd in the literature in recent years. How- 


derivativa » 

ands possem 

t the condita 
= ôx’ opi iee 


ure the true CV or EV yields exceedingly 
il errors for relatively large price changes. 
reason for this accuracy is that the 
emand obviously provides an ex- 
proximation to the Hicksian de- 


rium. It is this tangency which yields 
Exal'path independence for the Slutsky 


which make the Marshallian demand 
suitable for welfare analysis do not con- 


vector ee px’ = px, 


“eee 


c eA is conditioned, and u is 
Si~concave utility function. The 
utsky demand takes the general 


= x (p, x°). 
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It is also clear that the following identity holds: 


(A3) xf (p,x°) =x¥ (p. Zp. 


= (p.o(p 22) ) f 


where v is the indirect utility function, x” 
is the Marshallian demand, and x? is the ` 


Hicksian demand. Differentiating (A3) with 


respect to p;, and applying Roy’s identity, 


axs 
p ek eee 
Sat Op; Op; 


where x is the demand for x; when y = 
>; pix? and Ox! /Op; is the slope of the 
Hicksian demand, x”, conditioned on 
u(x°), at any p. 

Two results follow from this when com- 
paring the Hicksian demand conditioned on 
u(x?) and the Slutsky demand conditioned 
on x°: 


(i) When prices and income are such that the 
quantity actually demanded (x? ) is equal 
to the bundle on which the Slutsky de- 
mand is conditioned (x?), then the 
Hicksian and the Slutsky demands have 
the same slope. That is, they are tangent 
as is shown in Figure 1. 

When prices and income are such that 
x” > x?, then the Hicksian demand is 
steeper than the Slutsky demand at x, 
assuming that x; is a normal good. Con- 
versely, when prices and income are such 
that x” < x°, then the Hicksian demand 
is flatter than the Slutsky demand at x¥ , 
assuming that x; is a normal good. 


(ii) 


This demonstrates that any particular 
Hicksian demand is an envelope of Slutsky de- 
mands, since this result can be demonstrated 
for any arbitrary bundle, x’, that corresponds 
to a point on the Hicksian demand conditioned 
on u(x°). 

If (A3) is differentiated with respect to p;, 
the resulting equation demonstrates the local 
path independence property discussed in 
Section IN, subsection C. : 
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